sciteco - Scintilla-based Text Editor and COrrector

Age	Commit message (Collapse)	Author	Files	Lines
2024-12-13	fixed lexer.checkheader: restore dot in case of success	Robin Haberkorn	1	-1/+1

2024-12-13	fixup 244a54a18b7db6af177c9d10f3224772f08d7484: abuse the Scintilla view's ↵	Robin Haberkorn	3	-11/+13
	"identifier" to enable lexing in the container * SCI_SETILEXER(NULL) is not a reliable way to do that since that's the default for all views. * This was breaking the git.tes lexer for instance and was unnecessarily driving teco_lexer_style() on plain-text documents. * Since we currently do not implement the ILexer5 C++ interface and teco_view_t is just a pointer alias, we are abusing the view's "identifier" instead. This is probably sufficient, as long as there is only one lexer "in the container". Otherwise, there should perhaps be a single C++ class that does nothing but wrapping a callback into an ILexer5 object with a C ABI.
2024-12-13	document the FK...$^SR idiom	Robin Haberkorn	3	-2/+3
	* We don't actually have to negate ^S results after FK. For deleting the matched pattern, you can use ^YD or -^SD.
2024-12-13	updated README: mention community resources	Robin Haberkorn	1	-0/+10

2024-12-13	updated INSTALL: how to install dependencies on Fedora and FreeBSD	Robin Haberkorn	1	-0/+10

2024-12-13	implemented Scintilla lexer for SciTECO code, i.e. TECO syntax highlighting	Robin Haberkorn	19	-27/+362
	* this works by embedding the SciTECO parser and driving it always (exclusively) in parse-only mode. * A new teco_state_t::style determines the Scintilla style for any character accepted in the given state. * Therefore, the SciTECO lexer is always 100% exact and corresponds to the current SciTECO grammer - it does not have to be maintained separately. There are a few exceptions and tweaks, though. * The contents of curly-brace escapes (`@^Uq{...}`) are rendered as ordinary code using a separate parser instance. This can be disabled with the lexer.sciteco.macrodef property. Unfortunately, SciTECO does not currently allow setting lexer properties (FIXME). * Labels and comments are currently styled the same. This could change in the future once we introduce real comments. * Lexers are usually implemented in C++, but I did not want to draw in C++. Especially not since we'd have to include parser.h and other SciTECO headers, that really do not want to keep C++-compatible. Instead, the lexer is implemented "in the container". @ES/SCI_SETILEXER/sciteco/ is internally translated to SCI_SETILEXER(NULL) and we get Scintilla notifications when styling the view becomes necessary. This is then centrally forwarded to the teco_lexer_style() which uses the ordinary teco_view_ssm() API for styling. * Once the command line becomes a Scintilla view even on Curses, we can enabled syntax highlighting of the command line macro.
2024-12-10	updated NEWS: mentioned new release	Robin Haberkorn	1	-0/+3

2024-12-10	fixed compiler warnings when building release builds	Robin Haberkorn	1	-2/+2
	* g_assert() apparently does not reference the expression when assertions are disabled in contrast to glibc's assert()
2024-12-09	updated Debian package for v2.2.0 release	Robin Haberkorn	1	-0/+6

2024-12-09	updated FreeBSD package to v2.2.0	Robin Haberkorn	3	-34/+36

2024-12-09	prepared v2.2.0 releasev2.2.0	Robin Haberkorn	3	-31/+107
	This release is mainly for the upcoming FreeBSD package.
2024-12-08	fixed rubbing out file open with glob patterns	Robin Haberkorn	1	-5/+4
	* This would crash if <EB> opened more than one file, e.g. EB.c$. The reason is that teco_current_doc_undo_edit() must be called before every teco_ring_edit(). Unfortunately, this is not reproduceable with sciteco --no-profile --fake-cmdline '@EB"foo*.txt"{HK}' since the crashes actually happen when printing messages in interactive mode. That's why no test case has been added.
2024-12-08	implemented the ^Q command for converting between line and glyph positions	Robin Haberkorn	4	-7/+76
	* As known from DEC TECO, but extended to convert absolute positions to line numbers as well. :^Q returns the current line. * Especially useful in macros that accept line arguments, as it is much shorter than something like ^E@ES/LINEFROMPOSITION//+Q.l@ES/POSITIONFROMLINE//:^E-. * On the other hand, the fact that ^Q checks the line range means we cannot easily replace lexer.checkheader with something like [:J 0,^Q::S...$ ]: Using SCI_POSITIONFROMLINE still has the advantage that it returns `Z` for out-of-bounds ranges which would be cumbersome to write with the current ^Q. * Perhaps there should be a separate command for converting between absolute lines and positions and :^Q should be repurposed to return a failure boolean for out-of-range values? * fnkeys.tes could be simplified.
2024-12-06	support the ::S anchored search (string comparison) command (and ::FD, ::FR, ↵	Robin Haberkorn	15	-72/+130
	::FS as well) * The colon modifier can now occur 2 times. Specifying `@` more than once or `:` more than twice is an error now. * Commands do not check for excess colon modifiers - almost every command would have to check it. Instead, a double colon will simply behave like a single colon on most commands. * All search commands inherit the anchored semantics, but it's not very useful in some combinations like -::S, ::N or ::FK. That's why the `::` variants are not documented everywhere. * The lexer.checkheader macro could be simplified and should also be faster now, speeding up startup. Eventually this macro can be made superfluous, e.g. by using 1:FB or 0,1^Q::S.
2024-12-05	nightly builds: use Mac OS 13 instead of the deprecated version 12	Robin Haberkorn	1	-1/+1
	* 13 is now the oldest supported version
2024-12-05	updated NEWS: mention the chatroom poll	Robin Haberkorn	1	-2/+2

2024-12-04	use the new ^Y, ^S and @Xq commands in tedoc.tes and string.tes	Robin Haberkorn	2	-7/+7

2024-12-04	the <Xq> command now supports the @ modifier for cutting into the register	Robin Haberkorn	6	-12/+79
	* Can be freely combined with the colon-modifier as well. :@Xq cut-appends to register q. * This simply deletes the given buffer range after the copy or append operation as if followed by another <K> command. * This has indeed been a very annoying missing feature, as you often have to retype the range for a K or D command. At the same time, this cannot be reasonably solved with a macro since macros do not accept Q-Register arguments -- so we would have to restrict ourselves to one or a few selected registers. I was also considering to solve this with a special stack operation that duplicates the top values, so that Xq leaves arguments for K, but this couldn't work for cutting lines and would also be longer to type. * It's the first non-string command that accepts @. Others may follow in the future. We're approaching ITS TECO madness levels.
2024-12-04	implemented ^Y/^S commands for receiving pattern match/insertion ranges and ↵	Robin Haberkorn	8	-25/+224
	lengths (refs #27) * Allows storing pattern matches into Q-Registers (^YXq). * You can also refer to subpatterns marked by ^E[...] by passing a number > 0. This is equivalent to \0-9 references in many programming languages. * It's especially useful for supporting TECO's equivalent of structural regular expressions. This will be done with additional macros. * You can also simply back up to the beginning of an insertion or search. So I...$^SC leaves dot at the beginning of the insertion. S...$^SC leaves dot before the found pattern. This has been previously requested by users. * Perhaps there should be ^Y string building characters as well to backreference in search-replacement commands (TODO). This means that the search commands would have to store the matched text itself in teco_range_t structures since FR deletes the matched text before processing the replacement string. It could also be made into a FR/FS-specific construct, so we don't fetch the substrings unnecessarily. * This differs from DEC TECO in always returning the same range even after dot movements, since we are storing start/end byte positions instead of only the length. Also DEC TECO does not support fetching subpattern ranges.
2024-11-30	sciteco(7): fixed outdated information about the STYLE_CALLTIP default colors	Robin Haberkorn	1	-3/+3

2024-11-30	updated Scinterm to the official v5.2 release	Robin Haberkorn	1	-0/+0
	This does not change anything functionally.
2024-11-25	fixed operator precedence application (fixup ↵	Robin Haberkorn	2	-7/+12
	5597bc72671d0128e6f0dba446c4dc8d47bf37d0) * Using teco_expressions_eval() is wrong since it does not pay attention to precedences. If you have multiple higher precedence operators in a row, as in 2+345, the lower precedence operators would be resolved prematurely. * Instead we now call teco_expressions_calc() repeatedly but only for lower precedence operators on the stack top. This makes sure that as much of the expression as possible is evaluated at any given moment.
2024-11-25	avoid dynamic stack allocation in teco_expressions_brace_return()	Robin Haberkorn	1	-4/+1
	* This is not safe since the size of the stack object comes from the "outside" world, so stack overflows can theoretically be provoked by macros.
2024-11-25	fixed subtle operator precedence bug	Robin Haberkorn	2	-1/+7
	* It was possible to provoke operator right-associativity when placing a high-precedence operator between two low-precedence operators. 1-65-1 evaluated to -28 instead of the expected -30. The reason is that SciTECO relies on operators to be resolved from left-to-right as soon as possible. The higher precedence operator prevents that and pushing the 2nd "-" only evaluated 65. At the end 1-30-1 would be left on the stack. teco_expressions_eval() however evaluates from right-to-left which is wrong in this case. Instead, we now do a full eval on every operator with a lower precedence, making sure that 1-30 is evaluated first.
2024-11-24	sciteco(7): minor documentation fix	Robin Haberkorn	1	-1/+1

2024-11-24	lexer.auto: use case-sensitive searches	Robin Haberkorn	1	-0/+0
	* lexer.checkheader is therefore case-sensitive now as well
2024-11-24	added special Q-Register ":" for accessing dot	Robin Haberkorn	9	-15/+76
	* We cannot call it "." since that introduces a local register and we don't want to add an unnecessary syntactic exception. * Allows the idiom [: ... ]: to temporarily move around. Also, you can now write ^E\: without having to store dot in a register first. * In the future we might add an ^E register as well for byte offsets. However, there are much fewer useful applications. * Of course, you can now also write nU: instead of nJ, Q: instead of "." and n%: instead of "nC.". However it's all not really useful.
2024-11-24	minor documentation changes: use typographic quotes instead of "	Robin Haberkorn	2	-3/+3

2024-11-23	string building: ^c (caret+c) does no longer expand to data garbage for ↵	Robin Haberkorn	1	-0/+9
	non-control characters, but to the literal caret, followed by c * For instance `^$` would insert two characters. * The alternative would have been to throw an error.
2024-11-23	disallow setting the radix to values lower than 2	Robin Haberkorn	3	-8/+34
	* This would actually causes crashes when trying to format numbers. * The ^R local register has a custom set_integer() method now, so that the check is performed also when using nU.^X.
2024-11-23	the search mode and current radix are mapped to __local__ Q-Registers ^X and ↵	Robin Haberkorn	12	-56/+185
	^R now (refs #17) * This way the search mode and radix are local to the current macro frame, unless the macro was invoked with :Mq. If colon-modified, you can reproduce the same effect by calling [.^X 0^X ... ].^X * The radix register is cached in the Q-Reg table as an optimization. This could be done with the other "special" registers as well, but at the cost of larger stack frames. * In order to allow constructs like [.^X typed with upcarets, the Q-Register specification syntax has been extended: ^c is the corresponding control code instead of the register "^".
2024-11-23	implemented search mode flag (^X): allow case-sensitive searches (closes #17)	Robin Haberkorn	5	-6/+41
	* Usually you will only want -^X for enabling case sensitive searches and 0^X for case-insensitive searches (which is also the default). * An open question is what happens if the user sets -^X and then calls a macro. The search mode flag should probably be stacked away along with the search-string. This means we'd need a ^X special Q-Reg as well, so you can write [^X[_ 0^X S...$ ]_]^X. Alternatively, the search mode flag should be a property of the macro frame, along with the radix.
2024-11-19	minor documentation fixes	Robin Haberkorn	2	-3/+5
	* also explicitly mention -%q
2024-11-18	fixed some common typos: "ie." and "eg.", "ocur" instead of "occur"	Robin Haberkorn	14	-36/+36

2024-11-18	Debian/Ubuntu, FreeBSD and website updates for v2.1.1 release	Robin Haberkorn	4	-5/+11

2024-11-17	updated TODO and ChangeLog for v2.1.1 releasev2.1.1	Robin Haberkorn	2	-1/+62

2024-11-11	session.tes: store the current tab style (width and hard-tabs); fixed for ↵	Robin Haberkorn	2	-0/+5
	filenames containing ASCII 27 * You can now set a per-file tab style, that differs from the defaults established in the ED hook. This is important especially since we do not yet support per-project .teco_ini scripts where you could establish differing policies depending on the VCS repository. (The latter would be easy to implement, but we cannot currently easily extend the existing ED hooks.) * It's unlikely that files contain an ASCII 27, but not impossible. Therefore we now use ASCII 0 (^@) as a terminator. This indeed be safe under UNIX. Even better would be a string building construct for escaping ASCII 27 ($), though, as that would work with arbitrary bytes.
2024-11-10	Win32: fixed Unicode commandlines with newer MinGW runtimes	Robin Haberkorn	3	-5/+18
	* should also fix Win32 nightly builds * Even though we weren't using main's argv, but were using glib API for retrieving the command line in UTF-8, newer MinGW runtimes would fail when converting the Unicode command line into the system codepage would be lossy. * Most people seem to compile in a "manifest" to work around this issue. But this requires newer Windows versions and using some Microsoft tool which isn't even in $PATH. Instead, we now link with -municode and define wmain() instead, even though we still ignore argv. wmain() proabably get's the command line in UTF-16 and we'd have to convert it anyway. * See https://github.com/msys2/MINGW-packages/issues/22462
2024-11-10	fixed installation with absolute "scitecodatadir" (ie. if building ↵	Robin Haberkorn	1	-2/+2
	non-relocatable binary) * This was accidentally installing into $bindir/usr (usually /usr/local/bin/usr/...) You might want to check whether SciTECO accidentally installed something there.
2024-11-10	changed target release to v2.1.1	Robin Haberkorn	1	-1/+1
	The next release will include almost exclusively bug fixes.
2024-11-10	updated grosciteco.tes(1): mention new macros, changed command lines and ↵	Robin Haberkorn	2	-1/+19
	restrictions
2024-11-10	updated TODO	Robin Haberkorn	2	-4/+24

2024-11-10	grosciteco: basic support for GNU pic graphics	Robin Haberkorn	1	-26/+52
	* The line drawing algorithm currently works only with tbl, though. * Also only straight lines are currently supported. * This was meant for rendering presentations in SciTECO - it's not currently used or planned to be used in the manpages. Although we might well add pic graphics to the manpages in the future.
2024-11-10	grosciteco: support .SCITECO_STARTSTYLING and .SCITECO_SETSTYLING macros	Robin Haberkorn	2	-1/+21
	This could theoretically be used to apply Scintilla styles not natively and easily supported by grosciteco, eg. different fonts and font sizes.
2024-11-10	grosciteco: added some more quote glyphs	Robin Haberkorn	1	-0/+2

2024-11-07	if a macro ends without finding a goto label, always throw a 'Label "..." ↵	Robin Haberkorn	1	-7/+7
	not found' error * This is important with gotos in loops as in <@O/x/> where, we would otherwise get a confusing "Unterminated loop" error. * This in particular fixes the error thrown in grosciteco.tes when encountering a new unknown command.
2024-11-07	grosciteco: fixed the CuXXXX postprocessor command, ie. insertion of ↵	Robin Haberkorn	1	-1/+1
	characters by unicode, ie. typesetting of most non-latin text * This was broken at least for characters that happened to contain hexadecimal digits > 9 since "D does not detect hexadecimal digits.
2024-11-07	grosciteco: fixed styling of multi-byte text (ie. non-latin characters)	Robin Haberkorn	1	-1/+1
	* We just passed the length in glyphs to SCI_SETSTYLING.
2024-11-07	test suite: fixed failure detection in the commandline-editing test cases	Robin Haberkorn	3	-3/+17
	* The program exit code will usually not signal failures since they are caught earlier. * Therefore, we always have to capture and check stderr.
2024-11-06	fixed the Q-Reg spec machine used for implementing S^EGq$ (match one of ↵	Robin Haberkorn	4	-21/+33
	characters in Q-Register) * It was initialized only once, so it could inherit the wrong local Q-Register table. A test case has been added for this particular bug. * Also, if starting from the profile (batch mode), the state machine could be initialized without undo, which then later cause problems on rubout in interactive mode. For instance, if S^EG[a] fails and you would repeatedly type `]`, the Q-Reg name could grow indefinitely. There were probably other issues as well. Even crashes should have been possible, although I couldn't reproduce them. * Since the state machine is required only for the pattern to regexp translation and is performed anew for every character in interactive mode, we now create a fresh state machine for every call and don't attempt any undo. There might be more efficient ways, like reusing the string building's Q-Reg parser state machine.