Categories:
Audio (13)
Biotech (29)
Bytecode (36)
Database (77)
Framework (7)
Game (7)
General (507)
Graphics (53)
I/O (35)
IDE (2)
JAR Tools (101)
JavaBeans (21)
JDBC (121)
JDK (426)
JSP (20)
Logging (108)
Mail (58)
Messaging (8)
Network (84)
PDF (97)
Report (7)
Scripting (84)
Security (32)
Server (121)
Servlet (26)
SOAP (24)
Testing (54)
Web (15)
XML (309)
Collections:
Other Resources:
iText kernel.jar Source Code
kernel.jar is a component in iText Java library to provide low-level functionalities. iText Java library allows you to generate and manage PDF documents.
The Source Code files are provided together with the JAR file in the binary packge like iText7-Core-7.1.4.zip. You can download it at iText 7 Core Download site.
You can compile it to generate your JAR file, using kernel.pom as the build configuration file.
The source code of kernel-7.1.4.jar is provided below:
✍: FYIcenter.com
⏎ com/itextpdf/kernel/xmp/impl/Latin1Converter.java
//Copyright (c) 2006, Adobe Systems Incorporated //All rights reserved. // // Redistribution and use in source and binary forms, with or without // modification, are permitted provided that the following conditions are met: // 1. Redistributions of source code must retain the above copyright // notice, this list of conditions and the following disclaimer. // 2. Redistributions in binary form must reproduce the above copyright // notice, this list of conditions and the following disclaimer in the // documentation and/or other materials provided with the distribution. // 3. All advertising materials mentioning features or use of this software // must display the following acknowledgement: // This product includes software developed by the Adobe Systems Incorporated. // 4. Neither the name of the Adobe Systems Incorporated nor the // names of its contributors may be used to endorse or promote products // derived from this software without specific prior written permission. // // THIS SOFTWARE IS PROVIDED BY ADOBE SYSTEMS INCORPORATED ''AS IS'' AND ANY // EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED // WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE // DISCLAIMED. IN NO EVENT SHALL ADOBE SYSTEMS INCORPORATED BE LIABLE FOR ANY // DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES // (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; // LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND // ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT // (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS // SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. // // http://www.adobe.com/devnet/xmp/library/eula-xmp-library-java.html package com.itextpdf.kernel.xmp.impl; import java.io.UnsupportedEncodingException; /** * @since 12.10.2006 */ public class Latin1Converter { /** */ private static final int STATE_START = 0; /** */ private static final int STATE_UTF8CHAR = 11; /** * Private constructor */ private Latin1Converter() { // EMPTY } /** * A converter that processes a byte buffer containing a mix of UTF8 and Latin-1/Cp1252 chars. * The result is a buffer where those chars have been converted to UTF-8; * that means it contains only valid UTF-8 chars. * <p> * <em>Explanation of the processing:</em> First the encoding of the buffer is detected looking * at the first four bytes (that works only if the buffer starts with an ASCII-char, * like xmls '<'). UTF-16/32 flavours do not require further processing. * <p> * In the case, UTF-8 is detected, it assumes wrong UTF8 chars to be a sequence of * Latin-1/Cp1252 encoded bytes and converts the chars to their corresponding UTF-8 byte * sequence. * <p> * The 0x80..0x9F range is undefined in Latin-1, but is defined in Windows code * page 1252. The bytes 0x81, 0x8D, 0x8F, 0x90, and 0x9D are formally undefined * by Windows 1252. These are in XML's RestrictedChar set, so we map them to a * space. * <p> * The official Latin-1 characters in the range 0xA0..0xFF are converted into * the Unicode Latin Supplement range U+00A0 - U+00FF. * <p> * <em>Example:</em> If an Euro-symbol (€) appears in the byte buffer (0xE2, 0x82, 0xAC), * it will be left as is. But if only the first two bytes are appearing, * followed by an ASCII char a (0xE2 - 0x82 - 0x41), it will be converted to * 0xC3, 0xA2 (â) - 0xE2, 0x80, 0x9A (‚) - 0x41 (a). * * @param buffer a byte buffer contain * @return Returns a new buffer containing valid UTF-8 */ public static ByteBuffer convert(ByteBuffer buffer) { if ("UTF-8".equals(buffer.getEncoding())) { // the buffer containing one UTF-8 char (up to 8 bytes) byte[] readAheadBuffer = new byte[8]; // the number of bytes read ahead. int readAhead = 0; // expected UTF8 bytesto come int expectedBytes = 0; // output buffer with estimated length ByteBuffer out = new ByteBuffer(buffer.length() * 4 / 3); int state = STATE_START; for (int i = 0; i < buffer.length(); i++) { int b = buffer.charAt(i); switch (state) { default: case STATE_START: if (b < 0x7F) { out.append((byte) b); } else if (b >= 0xC0) { // start of UTF8 sequence expectedBytes = -1; int test = b; for (; expectedBytes < 8 && (test & 0x80) == 0x80; test = test << 1) { expectedBytes++; } readAheadBuffer[readAhead++] = (byte) b; state = STATE_UTF8CHAR; } else // implicitly: b >= 0x80 && b < 0xC0 { // invalid UTF8 start char, assume to be Latin-1 byte[] utf8 = convertToUTF8((byte) b); out.append(utf8); } break; case STATE_UTF8CHAR: if (expectedBytes > 0 && (b & 0xC0) == 0x80) { // valid UTF8 char, add to readAheadBuffer readAheadBuffer[readAhead++] = (byte) b; expectedBytes--; if (expectedBytes == 0) { out.append(readAheadBuffer, 0, readAhead); readAhead = 0; state = STATE_START; } } else { // invalid UTF8 char: // 1. convert first of seq to UTF8 byte[] utf8 = convertToUTF8(readAheadBuffer[0]); out.append(utf8); // 2. continue processing at second byte of sequence i = i - readAhead; readAhead = 0; state = STATE_START; } break; } } // loop ends with "half" Utf8 char --> assume that the bytes are Latin-1 if (state == STATE_UTF8CHAR) { for (int j = 0; j < readAhead; j++) { byte b = readAheadBuffer[j]; byte[] utf8 = convertToUTF8(b); out.append(utf8); } } return out; } else { // Latin-1 fixing applies only to UTF-8 return buffer; } } /** * Converts a Cp1252 char (contains all Latin-1 chars above 0x80) into a * UTF-8 byte sequence. The bytes 0x81, 0x8D, 0x8F, 0x90, and 0x9D are * formally undefined by Windows 1252 and therefore replaced by a space * (0x20). * * @param ch * an Cp1252 / Latin-1 byte * @return Returns a byte array containing a UTF-8 byte sequence. */ private static byte[] convertToUTF8(byte ch) { int c = ch & 0xFF; try { if (c >= 0x80) { if (c == 0x81 || c == 0x8D || c == 0x8F || c == 0x90 || c == 0x9D) { return new byte[] { 0x20 }; // space for undefined } // interpret byte as Windows Cp1252 char return new String(new byte[] { ch }, "cp1252").getBytes("UTF-8"); } } catch (UnsupportedEncodingException e) { // EMPTY } return new byte[] { ch }; } }
⏎ com/itextpdf/kernel/xmp/impl/Latin1Converter.java
Or download all of them as a single archive file:
File name: kernel-7.1.4-sources.jar File size: 1201221 bytes Release date: 2018-10-09 Download
⇐ Download and Install iText7-Core-7.1.4.zip
2010-02-18, 85395👍, 0💬
Popular Posts:
The Apache FontBox library is an open source Java tool to obtain low level information from font fil...
What Is HttpComponents httpcore-4.2.2.jar? HttpComponents httpcore-4.2.2.jar is the JAR file for Apa...
JDK 11 jdk.jfr.jmod is the JMOD file for JDK 11 JFR module. JDK 11 JFR module compiled class files a...
The Apache FontBox library is an open source Java tool to obtain low level information from font fil...
How to download and install mysql-connector-j-8.0.31 .zip?Connector/J Java library is a JDBC Driver ...