Data.Text.Lazy.Encoding
| Copyright | (c) 2009 2010 Bryan O'Sullivan |
|---|---|
| License | BSD-style |
| Maintainer | [email protected] |
| Portability | portable |
| Safe Haskell | Trustworthy |
| Language | Haskell2010 |
Description
Functions for converting lazy Text values to and from lazy ByteString, using several standard encodings.
To gain access to a much larger family of encodings, use the text-icu package.
Decoding ByteStrings to Text
All of the single-parameter functions for decoding bytestrings encoded in one of the Unicode Transformation Formats (UTF) operate in a strict mode: each will throw an exception if given invalid input.
Each function has a variant, whose name is suffixed with -With, that gives greater control over the handling of decoding errors. For instance, decodeUtf8 will throw an exception, but decodeUtf8With allows the programmer to determine what to do on a decoding error.
decodeASCII :: ByteString -> Text Source
Deprecated: Use decodeUtf8 instead
Deprecated. Decode a ByteString containing 7-bit ASCII encoded text.
decodeLatin1 :: ByteString -> Text Source
Decode a ByteString containing Latin-1 (aka ISO-8859-1) encoded text.
decodeUtf8 :: ByteString -> Text Source
Decode a ByteString containing UTF-8 encoded text that is known to be valid.
If the input contains any invalid UTF-8 data, an exception will be thrown that cannot be caught in pure code. For more control over the handling of invalid data, use decodeUtf8' or decodeUtf8With.
decodeUtf16LE :: ByteString -> Text Source
Decode text from little endian UTF-16 encoding.
If the input contains any invalid little endian UTF-16 data, an exception will be thrown. For more control over the handling of invalid data, use decodeUtf16LEWith.
decodeUtf16BE :: ByteString -> Text Source
Decode text from big endian UTF-16 encoding.
If the input contains any invalid big endian UTF-16 data, an exception will be thrown. For more control over the handling of invalid data, use decodeUtf16BEWith.
decodeUtf32LE :: ByteString -> Text Source
Decode text from little endian UTF-32 encoding.
If the input contains any invalid little endian UTF-32 data, an exception will be thrown. For more control over the handling of invalid data, use decodeUtf32LEWith.
decodeUtf32BE :: ByteString -> Text Source
Decode text from big endian UTF-32 encoding.
If the input contains any invalid big endian UTF-32 data, an exception will be thrown. For more control over the handling of invalid data, use decodeUtf32BEWith.
Catchable failure
decodeUtf8' :: ByteString -> Either UnicodeException Text Source
Decode a ByteString containing UTF-8 encoded text..
If the input contains any invalid UTF-8 data, the relevant exception will be returned, otherwise the decoded text.
Note: this function is not lazy, as it must decode its entire input before it can return a result. If you need lazy (streaming) decoding, use decodeUtf8With in lenient mode.
Controllable error handling
decodeUtf8With :: OnDecodeError -> ByteString -> Text Source
Decode a ByteString containing UTF-8 encoded text.
decodeUtf16LEWith :: OnDecodeError -> ByteString -> Text Source
Decode text from little endian UTF-16 encoding.
decodeUtf16BEWith :: OnDecodeError -> ByteString -> Text Source
Decode text from big endian UTF-16 encoding.
decodeUtf32LEWith :: OnDecodeError -> ByteString -> Text Source
Decode text from little endian UTF-32 encoding.
decodeUtf32BEWith :: OnDecodeError -> ByteString -> Text Source
Decode text from big endian UTF-32 encoding.
Encoding Text to ByteStrings
encodeUtf8 :: Text -> ByteString Source
Encode text using UTF-8 encoding.
encodeUtf16LE :: Text -> ByteString Source
Encode text using little endian UTF-16 encoding.
encodeUtf16BE :: Text -> ByteString Source
Encode text using big endian UTF-16 encoding.
encodeUtf32LE :: Text -> ByteString Source
Encode text using little endian UTF-32 encoding.
encodeUtf32BE :: Text -> ByteString Source
Encode text using big endian UTF-32 encoding.
Encoding Text using ByteString Builders
encodeUtf8Builder :: Text -> Builder Source
Encode text to a ByteString Builder using UTF-8 encoding.
Since: text-1.1.0.0
encodeUtf8BuilderEscaped :: BoundedPrim Word8 -> Text -> Builder Source
Encode text using UTF-8 encoding and escape the ASCII characters using a BoundedPrim.
Use this function is to implement efficient encoders for text-based formats like JSON or HTML.
Since: text-1.1.0.0
© The University of Glasgow and others
Licensed under a BSD-style license (see top of the page).
https://downloads.haskell.org/~ghc/8.10.2/docs/html/libraries/text-1.2.3.2/Data-Text-Lazy-Encoding.html