Commit Graph

1502 Commits

Author SHA1 Message Date
Aleksandras Kostarevas
26aed21aff Add fallback for empty mixes 2024-08-05 14:45:16 +03:00
Aleksandras Kostarevas
c308504a25 Update tablet layouts, fix tablet swipe typing 2024-05-27 21:44:06 +03:00
Aleksandras Kostarevas
dcce3ea5ae Skip token mixes if total sum is 0
This case seems to actually be quite common during fuzzing
2024-05-16 18:37:43 -05:00
Aleksandras Kostarevas
be5ed15220 Fix some linter warnings 2024-05-16 17:18:08 -05:00
Aleksandras Kostarevas
0b9f1ca074 Update traversal 2024-05-16 15:09:51 -05:00
Aleksandras Kostarevas
b59aa89363 Use more C++ style memory management 2024-05-16 14:33:02 -05:00
Aleksandras Kostarevas
e19de589f1 Skip invalid token mixes 2024-05-16 12:18:13 -05:00
Aleksandras Kostarevas
edbb92114f Ensure skipProbability is in range 2024-05-16 12:07:28 -05:00
Aleksandras Kostarevas
09a6a30d8b Use jstring2string for more strings 2024-05-16 12:07:16 -05:00
Aleksandras Kostarevas
99d5fda170 Fix incorrect LM results with number row 2024-05-07 11:31:15 -05:00
Aleksandras Kostarevas
0e1a338f0d Re-enable is_bugged check for now 2024-04-30 14:14:41 -04:00
Aleksandras Kostarevas
57cb64f8bd Remove some logging 2024-04-30 13:58:46 -04:00
Aleksandras Kostarevas
85de4c86d4 Disable is_bugged check 2024-04-29 22:19:49 -04:00
Aleksandras Kostarevas
0b1ad01f1a LM rescoring WIP 2024-04-28 21:55:32 -04:00
Aleksandras Kostarevas
7d5b12feaf Add payment menu 2024-04-28 21:52:42 -04:00
Aleksandras Kostarevas
f261de9fab Revert n_threads and n_threads_batch change 2024-04-23 15:21:53 -04:00
Aleksandras Kostarevas
46daec4972 Split by n_batch for llama_decode 2024-04-22 14:37:14 -04:00
Aleksandras Kostarevas
ed5dbe1f2c Update weighting and max limit 2024-04-22 10:49:07 -04:00
Aleksandras Kostarevas
a0cb585ef6 Better handle intentional omissions for swipe 2024-04-22 10:05:40 -04:00
Aleksandras Kostarevas
9e0dc28848 Add memory debug action window 2024-04-22 00:32:36 -04:00
Aleksandras Kostarevas
8ae3263822 Implement initial swipe typing 2024-04-18 10:29:10 -05:00
Aleksandras Kostarevas
9308bcbfb0 Reduce logging 2024-04-11 00:43:56 -05:00
Aleksandras Kostarevas
cbd75f9799 Fix some race conditions and properly free language model 2024-04-09 23:06:31 -05:00
Aleksandras Kostarevas
eb97ee3741 Enable logging in builds 2024-04-09 14:45:13 -05:00
Aleksandras Kostarevas
63016fd067 Add more logging to whispercpp 2024-04-09 14:41:32 -05:00
Aleksandras Kostarevas
d379cb103b Fix potential crash in transformer_context_fastforward 2024-04-09 13:26:12 -05:00
Aleksandras Kostarevas
bec40d167c Fix log assertions ggml 2024-03-30 19:11:01 -05:00
Aleksandras Kostarevas
1331b1c6d6 Fix log assertions ggml 2024-03-30 19:02:47 -05:00
Aleksandras Kostarevas
e75eff11e1 Log assertions ggml 2024-03-30 18:58:07 -05:00
Aleksandras Kostarevas
5ecdafd60b Ensure conv graph measurement never undermeasures memory with dynamic audio ctx 2024-03-30 18:45:54 -05:00
Aleksandras Kostarevas
601d6df6b3 Cancel native inference early 2024-03-21 18:32:54 -05:00
Aleksandras Kostarevas
434a751d63 Fix modified utf-8 errors when returning strings 2024-03-21 16:49:45 -05:00
Aleksandras Kostarevas
c101772317 Add suppressNonSpeechTokens 2024-03-18 16:24:16 -05:00
Aleksandras Kostarevas
38055fae65 Add other workaround 2024-03-13 16:11:52 -05:00
Aleksandras Kostarevas
350b8e8fcf Add bad word filtering and blacklisting 2024-03-13 13:31:51 -05:00
Aleksandras Kostarevas
9fed68c03a Fix segfault when no results / only 1 result 2024-03-07 14:56:21 +02:00
Aleksandras Kostarevas
69649256f8 Fix comma key to open settings menu 2024-03-05 15:54:18 +02:00
Aleksandras Kostarevas
c57a3d83af Add personal dictionary glossary for voice input and keyboard 2024-03-05 15:24:30 +02:00
Aleksandras Kostarevas
42ac255a81 Sync whisper.cpp changes from voice input 2024-03-05 11:06:24 +02:00
Aleksandras Kostarevas
e4d41567b0 Update hyperparameter 2024-02-20 21:04:07 +02:00
Aleksandras Kostarevas
6453c15a21 Merge branch 'lm-2-finetuning-whisperggml' into 'model-metadata'
Add autocorrect threshold to model-metadata branch

See merge request alex/latinime!6
2024-02-03 15:18:27 +00:00
Aleksandras Kostarevas
c7113297fb Add radio selection for threshold 2024-02-01 21:55:56 +02:00
Aleksandras Kostarevas
a111164bb8 Improve algorithm in a few ways:
* If the first letter is capital, only capitalized first tokens will be sampled. If the whole text is capitalized, then only fully capital tokens will be sampled for the whole word
* If a word is an exact match, it gets boosted relative to others
* Probability threshold for autocorrect is now 18.0
* Add "clueless" threshold, if it's less than 1.3 then just show the user's typed word in the middle instead.
2024-01-30 20:30:44 +02:00
Aleksandras Kostarevas
f888ba3353 Fix finetuning, add a finetuning screen, handle errors during importing model, update metadata format, add model exporting 2024-01-30 17:14:02 +02:00
Aleksandras Kostarevas
5bf4492634 Fix embedded tokenizer loading, implement new model management methods, implement model info loading, model importing 2024-01-28 22:40:39 +02:00
Aleksandras Kostarevas
0021b6aa04 Model metadata and manager component 2024-01-24 01:03:16 +02:00
Aleksandras Kostarevas
7aea41eede Add decomposeTapPosition fallback for nonintersecting cases 2024-01-22 08:21:27 +02:00
Aleksandras Kostarevas
5e0722c984 Fix issue with apostrophe token being banned 2024-01-22 08:20:55 +02:00
Aleksandras Kostarevas
dbad61d2e6 Fix non-English dictionary prediction 2024-01-16 21:02:55 +02:00
Aleksandras Kostarevas
55d5959f54 Skip non-alphabetic characters during mixing 2024-01-09 18:25:14 +02:00