23 Commits

Author SHA1 Message Date
Aleksandras Kostarevas
38055fae65 Add other workaround 2024-03-13 16:11:52 -05:00
Aleksandras Kostarevas
350b8e8fcf Add bad word filtering and blacklisting 2024-03-13 13:31:51 -05:00
Aleksandras Kostarevas
9fed68c03a Fix segfault when no results / only 1 result 2024-03-07 14:56:21 +02:00
Aleksandras Kostarevas
c57a3d83af Add personal dictionary glossary for voice input and keyboard 2024-03-05 15:24:30 +02:00
Aleksandras Kostarevas
6453c15a21 Merge branch 'lm-2-finetuning-whisperggml' into 'model-metadata'
Add autocorrect threshold to model-metadata branch

See merge request alex/latinime!6
2024-02-03 15:18:27 +00:00
Aleksandras Kostarevas
c7113297fb Add radio selection for threshold 2024-02-01 21:55:56 +02:00
Aleksandras Kostarevas
a111164bb8 Improve algorithm in a few ways:
* If the first letter is capital, only capitalized first tokens will be sampled. If the whole text is capitalized, then only fully capital tokens will be sampled for the whole word
* If a word is an exact match, it gets boosted relative to others
* Probability threshold for autocorrect is now 18.0
* Add "clueless" threshold, if it's less than 1.3 then just show the user's typed word in the middle instead.
2024-01-30 20:30:44 +02:00
Aleksandras Kostarevas
0021b6aa04 Model metadata and manager component 2024-01-24 01:03:16 +02:00
Aleksandras Kostarevas
5e0722c984 Fix issue with apostrophe token being banned 2024-01-22 08:20:55 +02:00
Aleksandras Kostarevas
55d5959f54 Skip non-alphabetic characters during mixing 2024-01-09 18:25:14 +02:00
Aleksandras Kostarevas
ebb70b9c12 Fix build, disable gesture input pending model update 2023-12-19 20:28:58 +02:00
Aleksandras Kostarevas
4e9e86d871 Implement multimodal position encoder 2023-12-19 20:02:20 +02:00
Aleksandras Kostarevas
7075c22179 Add key embedding mixing 2023-12-04 20:09:51 +00:00
Aleksandras Kostarevas
4f15ff4a73 Add experimental swipe typing 2023-11-28 17:01:58 +00:00
Aleksandras Kostarevas
14fcb55565 Save LoRA-merged model after training 2023-11-14 20:40:00 +02:00
Aleksandras Kostarevas
0e0876f06c Revise training 2023-11-13 16:42:01 +02:00
Aleksandras Kostarevas
ee8a81f12c Initial fine-tuning 2023-11-07 16:48:48 +02:00
Aleksandras Kostarevas
5778cd15a0 Update ggml and llama.cpp 2023-11-06 13:41:25 +02:00
Aleksandras Kostarevas
7c4531e32d Fix crashes related to too large context 2023-10-16 18:24:00 +03:00
Aleksandras Kostarevas
92480fd460 Adjust space probability and mustNotAutocorrect 2023-10-13 18:44:38 +03:00
Aleksandras Kostarevas
c34a411989 Fix infinite prediction loop 2023-10-13 18:34:49 +03:00
Aleksandras Kostarevas
b8539ce88a Initial batched inference using llama_batch 2023-10-10 22:34:04 +03:00
Aleksandras Kostarevas
16fdb3629d Add LanguageModel class 2023-09-28 19:42:29 +03:00