aboutsummaryrefslogtreecommitdiffhomepage
path: root/src/UniConversion.cxx
AgeCommit message (Collapse)AuthorFilesLines
2024-02-28Add variant of UTF8Classify that takes a char* so that client code does not haveNeil1-15/+17
to reinterpret_cast. Make functions in header constexpr. Prefer .data() to &[] since safer. Avoid else when not needed.
2022-08-13Reduce warnings with noexcept, fewer casts, and other minor changes.Neil1-2/+2
2021-05-24Define C++ version of the Scintilla API in ScintillaTypes.h, ScintillaMessages.hNeil1-3/+1
and ScintillaStructures.h using scoped enumerations. Use these headers instead of Scintilla.h internally. External definitions go in the Scintilla namespace and internal definitio0ns in Scintilla::Internal.
2020-06-11Use noexcept where safe and maintainable.Neil1-1/+1
2019-03-20Use noexcept where sensible. Rename UTF8 string_view parameters for clarity.Neil1-29/+29
2019-03-20Implement WStringFromUTF8 to simplify code that creates wstring objects forNeil1-0/+25
regular expressions and calling the Win32 API.
2018-07-10Optional indexing of line starts in UTF-8 documents by UTF-32 code points andNeil1-0/+16
UTF-16 code units added.
2018-06-01Add function to find a UTF-16 position in a UTF-8 string.Neil1-0/+12
2018-05-14Use string_view for UniConversion functions.Neil1-31/+31
2018-05-14Include <string_view> to allow future use in Platform interface and Unicode.Neil1-0/+1
2018-04-27Avoid reinterpret_cast. Use size_t argument to UTF8Classify to avoid casts.Neil1-24/+22
2018-04-22Remove casts between char and unsigned char where possible.Neil1-2/+1
2018-04-21Tighten definition of regular expression iterators so they are noexcept andNeil1-3/+3
define all the standard member functions. This cascades to all methods called by the iterators, affecting Document, CellBuffer, Partitioning, SplitVector and UTF-8 and DBCS functions. Other trivial functions declared noexcept.
2018-03-25Removed comment about script that is not included.Neil1-1/+0
2018-03-25Feature [feature-requests:#1213]. Clarify UTF8Classify.Zufu Liu1-45/+42
Use UTF8BytesOfLead to determine expected length early in function to quickly detect argument not long enough, invalid single bytes and invalid first trail then branching on length for more detailed checks.
2018-03-24Removed unreachable branches in UTF8Classify.Neil1-8/+0
2018-03-24Feature [feature-requests:#1212]. Move Unicode conversions into UniConversion.Zufu Liu1-1/+21
Move Unicode conversion functions UnicodeFromUTF8 and UTF8FromUTF32Character into UniConversion.
2018-03-22Feature [feature-requests:#1211]. Use pre-computed table for UTF8BytesOfLead.Zufu Liu1-84/+96
Friendlier treatment of invalid UTF-8. Add tests for UniConversion handling invalid UTF-8. Simplify UTF8Classify tests.
2018-03-14Bug [#2001]. Make masking and comparison code clearer.Neil1-16/+25
2017-09-11The Scintilla namespace is always active for internal symbols and for the lexerNeil1-6/+0
interfaces ILexer4 and IDocument.
2017-05-02More consistent use of size_t when converting Unicode formats.Neil1-9/+9
2017-04-06Added const where possible.Neil1-5/+5
2017-04-01Standardize on C++ headers, remove headers that aren't needed and add <cstddef>Neil1-1/+1
where it may be needed in the future.
2017-03-02Fix potential problems with IME on Cocoa when document contains invalid UTF-8.Neil1-0/+23
2015-11-20Bug [#1779]. Better Unicode input support on Windows systems.Sam Hocevar1-4/+0
- support surrogate pairs in WM_CHAR messages - support characters from supplementary planes in WM_UNICHAR messages - support WM_UNICHAR messages in non-Unicode mode - fix some code duplication Also, do not return FALSE upon receiving a WM_UNICHAR message with a UNICODE_NOCHAR parameter, since WM_UNICHAR can actually be handled just fine (at least with the exact same level of support as WM_CHAR).
2015-07-04Make std::exception visible to all core code to remove chance that a differentNeil1-0/+2
tool set will not indirectly include <stdexcept>.
2015-02-24Notify container of non-BMP characters correctly.Neil1-1/+1
2015-02-23Fix non-BMP character entry through the inline IME.Neil1-3/+3
2015-01-13Using size_t instead of unsigned int for conversions to UTF16 for 64-bitNeil1-7/+7
compatibility and to lessen the number of casts.
2014-10-02Allow using C++11 <regex> for searches as a provisional feature.Neil1-0/+46
2013-12-15Format normalization - whitespace and braces made consistent.Neil1-1/+1
Parameter names added to method declarations.
2013-07-22Added the character representation feature.Neil1-0/+5
2013-07-21Standardising header guards and namespaces.Neil1-0/+12
2012-05-26Reduce code to avoid warning from cppcheck.nyamatongwe1-4/+2
2012-05-26Optimize UTF-8 character length calculations by using an array.nyamatongwe1-0/+28
2012-05-26Move classification of UTF-8 byte sequences into UniConversion module.nyamatongwe1-0/+91
2010-05-02Bug #2995278 minor fixes to typos and types.nyamatongwe1-1/+1
2010-03-23Added function for finding how many bytes are in a UTF-8 character.nyamatongwe1-0/+12
2010-02-17Formatting whitespace.nyamatongwe1-1/+1
2008-12-23Avoid warnings with gcc 4.3nyamatongwe1-1/+1
2007-04-19All Unicode planes supported, not just the Basic Multilingual Plane.nyamatongwe1-12/+55
2004-06-23Delete all markers for a particular marker number now deletesnyamatongwe1-1/+1
multiple markers on one line.
2001-10-28Fixed Platform changes for GTK+ and hid most of the implementation ofnyamatongwe1-4/+0
Surface.
2001-02-24Updated documentation comments from Philippe.nyamatongwe1-1/+4
2001-01-28Updating copyright notices for 2001.nyamatongwe1-1/+1
2000-04-10Death of Accessor.nyamatongwe1-0/+77
Birth of UniConversion.