diff options
| author | Robin Haberkorn <robin.haberkorn@googlemail.com> | 2024-09-11 14:30:24 +0200 | 
|---|---|---|
| committer | Robin Haberkorn <robin.haberkorn@googlemail.com> | 2024-09-11 16:14:27 +0200 | 
| commit | 2a050759ab621b87d0782cc8235907a1757b46cc (patch) | |
| tree | cde0c666146f833fc948a13c22056061f62d2619 /src/interface-curses/curses-utils.h | |
| parent | 68578072bfaf6054a96bb6bcedfccb6e56a508fe (diff) | |
fixed searches in single-byte encoded documents
* while code is guaranteed to be in valid UTF-8, this cannot be
  said about the result of string building.
* The search pattern can end up with invalid Unicode bytes even when
  searching on UTF-8 buffers, e.g. if ^EQq inserts garbage.
  There are currently no checks.
* When searching on a raw buffer, it must be possible to
  search for arbitrary bytes (^EUq).
  Since teco_pattern2regexp() was always expecting clean UTF-8 input,
  this would sometimes skip over too many bytes and could even crash.
* Instead, teco_pattern2regexp() now takes the <S> target codepage
  into account.
Diffstat (limited to 'src/interface-curses/curses-utils.h')
0 files changed, 0 insertions, 0 deletions
