Keisuke Kuroyanagi
88bc312ad3
Move dictionary code to top level dictionary dir.
...
Bug: 18725954
Change-Id: Ia442ba4b5d84311057d83edf6e7aeb151d6a820b
2014-12-17 16:02:09 +09:00
Keisuke Kuroyanagi
0bb038a19d
Remove needless include.
...
Bug: 18725954
Change-Id: I3c823fda1b7daf41d82b118d9495f5f2356a1a5e
2014-12-15 18:54:42 +09:00
Keisuke Kuroyanagi
5e1b225082
Add missing error check in TrieMap.
...
Bug: 18725954
Change-Id: I8fcb0d15dda3f823a4575fe707bcdec57ff5e89b
2014-12-15 18:53:22 +09:00
Keisuke Kuroyanagi
ad546afbaa
Remove dependency on jni.h from WordProperty.
...
Bug: 18725954
Change-Id: Ic97d3a56b036ff042322c9f794504064046fd7d7
2014-12-15 18:09:45 +09:00
Keisuke Kuroyanagi
52e92f812b
Merge "Quit auto-correct explicit accented letters to base letters."
2014-12-09 10:24:56 +00:00
Keisuke Kuroyanagi
515c508135
Quit auto-correct explicit accented letters to base letters.
...
Bug: 7677193
Change-Id: I66eddbf27a9db8682c0347a1be19922792a3bea7
2014-12-09 19:23:27 +09:00
Keisuke Kuroyanagi
60021bbdc2
Enable Quadgram for personalized dicts.
...
Before:
Total words: 1134659, Success Num: 944709, Success Percentage: 83.259%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1258, Bad Failure Percentage: 0.111%
Failures, with auto-correction (F-C): 28013, F-C Percentage: 2.469%
Max Keystrokes: 6072844, Min Keystrokes: 3347332, Keystroke Saving Percentage:44.880%
After:
Total words: 1134665, Success Num: 945026, Success Percentage: 83.287%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1271, Bad Failure Percentage: 0.112%
Failures, with auto-correction (F-C): 27756, F-C Percentage: 2.446%
Max Keystrokes: 6072850, Min Keystrokes: 3290996, Keystroke Saving Percentage:45.808%
Change-Id: I16af52a3e9c371b95fd6f0741f45ee6b2443bea6
2014-11-25 19:07:13 +09:00
Keisuke Kuroyanagi
78212a6d3d
Use enum to specify ngram type.
...
Change-Id: Ie28768ceadcd7a2d940c57eb30be7d4c364e509f
2014-11-25 19:07:10 +09:00
Jean Chalard
2a3ed8c988
Fix the base character of D with stroke
...
Bug: 18436480
Change-Id: Ic92cae7c85c07c8f62a5b6e69d2b71e204aff50d
2014-11-19 17:26:00 +09:00
Ken Wakasa
fc81196741
Revert "Follow up to https://android-review.googlesource.com/114561 "
...
This reverts commit 64d3f78ee5
per https://android-review.googlesource.com/114664
Change-Id: I2acab828d41e79847db72f8d2677c12173a323b6
2014-11-17 22:18:03 +00:00
Ken Wakasa
64d3f78ee5
Follow up to https://android-review.googlesource.com/114561
...
Looks like unbundled builds need to use --hash-style=sysv for the
sake of compatibility
Change-Id: Ia7b3a1cc3b2c91a8628551888a74925926dff855
2014-11-17 18:29:13 +09:00
Keisuke Kuroyanagi
580420d21b
Implement IntArrayView::split for dicttoolkit.
...
Bug: 10059681
Change-Id: Ic29e79d049bb532727cf5cb1e529fec5d35156ed
2014-11-11 15:06:48 +09:00
Keisuke Kuroyanagi
b23f03488f
Merge "Use reference instead of pointer for WordProperty()."
2014-11-10 18:32:24 +00:00
Keisuke Kuroyanagi
7d5420aa5e
Make profiler use getTimeInMicroSec().
...
Bug: 17797064
Change-Id: Ie992c9454edfc3bf93d5ea367c3a4427b513a205
2014-11-11 01:38:49 +09:00
Keisuke Kuroyanagi
bbf0d4141b
Use reference instead of pointer for WordProperty().
...
Change-Id: Idf03e97661d64186c752e35964d641a5528be5b1
2014-11-10 09:15:11 +09:00
Keisuke Kuroyanagi
2cf5550749
Fix: BoS prediction after inputting just once.
...
Change-Id: Ib69569ab6b6edfcc8c1d2c621b95de4127789ab6
2014-11-01 17:58:22 +09:00
Keisuke Kuroyanagi
b3bae2e89b
Merge "Update v4 format version from 402 to 403."
2014-10-31 14:19:44 +00:00
Keisuke Kuroyanagi
ef931546a0
Merge "Add hacks for better handling count value during migration."
2014-10-31 13:53:57 +00:00
Keisuke Kuroyanagi
a88c9682fc
Merge "Change v403 historical info format."
2014-10-31 13:38:38 +00:00
Keisuke Kuroyanagi
ea468cc9de
Update v4 format version from 402 to 403.
...
Without personalization:
Total words: 1134774, Success Num: 899230, Success Percentage: 79.243%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1871, Bad Failure Percentage: 0.165%
Failures, with auto-correction (F-C): 29084, F-C Percentage: 2.563%
Max Keystrokes: 6072959, Min Keystrokes: 4436090, Keystroke Saving Percentage:26.953%
Before:
Total words: 1134646, Success Num: 925194, Success Percentage: 81.540%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1316, Bad Failure Percentage: 0.116%
Failures, with auto-correction (F-C): 28288, F-C Percentage: 2.493%
Max Keystrokes: 6072831, Min Keystrokes: 3946188, Keystroke Saving Percentage:35.019%
After
Total words: 1134659, Success Num: 944746, Success Percentage: 83.263%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1258, Bad Failure Percentage: 0.111%
Failures, with auto-correction (F-C): 28016, F-C Percentage: 2.469%
Max Keystrokes: 6072844, Min Keystrokes: 3387333, Keystroke Saving Percentage:44.222%
Change-Id: I3af42ec37a11847c0429c28616e726f6a339247f
2014-10-31 17:23:39 +09:00
Keisuke Kuroyanagi
c611989929
Add hacks for better handling count value during migration.
...
Bug: 14425059
Change-Id: Ib050574aa7c4babd4285322a11c3af9be9fbab1e
2014-10-31 17:22:13 +09:00
Keisuke Kuroyanagi
2383575d2d
Change v403 historical info format.
...
count -> 2B, level -> 0B.
Change-Id: I3b241126f56eb33cdf09cb1ebfed04f534e4ec48
2014-10-31 17:22:13 +09:00
Adrian Velicu
009e02ce4a
Further fixes to treat 0-frequency words
...
Previously, when both legitimate 0-frequency words (such as
distracters) and offensive words were encoded in the same
way, distracters would never show up when the user blocked
offensive words (the default setting, as well as the setting
for regression tests).
When b/11031090 was fixed and a separate encoding was used
for offensive words, 0-frequency words would no longer be
blocked when they were an "exact match" (where case
mismatches and accent mismatches would be considered an
"exact match"). The exact match boosting functionality meant
that, for example, when the user typed "mt" they would be
suggested the word "Mt", although they most probably meant
to type "my".
For this reason, we introduced this change, which does the
following:
* Defines the "perfect match" as a really exact match, with
no room for case or accent mismatches
* When the target word has probability zero (as "Mt" does,
because it is a distracter), ONLY boost its score if it is a
perfect match.
By doing this, when the user types "mt", the word "Mt" will
NOT be boosted, and they will get "my". However, if the user
makes an explicit effort to type "Mt", we do boost the word
"Mt" so that the user's input is not autocorrected to "My".
Bug: 11031090
Change-Id: I92ee1b4e742645d52e2f7f8c4390920481e8fff0
2014-10-31 15:58:50 +09:00
Adrian Velicu
10416241f7
Block offensive words in multi-word suggestions
...
If the user has chosen to block offensive words and types
"aaaxbb", where "aaa" is an offensive word and "bb" is not,
we should not suggest "aaa bb".
Bug: 11031090
Change-Id: Ie23b8dd5d347bc26b1c046c3f5e8dfbc259bf528
2014-10-31 15:58:50 +09:00
Adrian Velicu
aa20342d7e
Merge "Using "blacklist" flag as "possibly offensive""
2014-10-31 06:49:29 +00:00
Adrian Velicu
7c87859d4c
Using "blacklist" flag as "possibly offensive"
...
Bug: 11031090
Change-Id: I5cc0d006ab003656498eb82b0875eb9c051d331e
2014-10-31 14:33:05 +09:00
Keisuke Kuroyanagi
0cd1f222fd
Fix: native unit test build.
...
Change-Id: Id2bd4b60d6a4023815a630ebb3059a435b72c193
2014-10-31 12:50:45 +09:00
Keisuke Kuroyanagi
bcb52d73e2
Enable count based dynamic ngram language model for v403.
...
Bug: 14425059
Change-Id: Icc15e14cfd77d37cd75f75318fd0fa36f9ca7a5b
2014-10-30 23:38:19 +09:00
Keisuke Kuroyanagi
660b00477c
Add DynamicLanguageModelProbabilityUtils.
...
Bug: 14425059
Change-Id: Ia58ab3f0ead02798046d182a9464dcbd95f086bc
2014-10-30 21:33:57 +09:00
Keisuke Kuroyanagi
0a9c3f30b6
Add method to encode probability.
...
Bug: 14425059
Change-Id: I3e5d359ba5fa38f1669f0e98dfae792ff53efbf8
2014-10-30 12:42:35 +09:00
Keisuke Kuroyanagi
c2ba0ce411
Fix: TRT and ime-simulator bulid.
...
Change-Id: I1697a907562d1ed6aff2b001763d1594263ba0d3
2014-10-30 01:01:40 +09:00
Keisuke Kuroyanagi
afe67611c3
Merge "Add a class to have global counters for LanguageModelDictContent."
2014-10-29 12:18:12 +00:00
Keisuke Kuroyanagi
6b0561f9d2
Add a class to have global counters for LanguageModelDictContent.
...
Bug: 14425059
Change-Id: I08ec19903432356b6028853fd73b4eefce20218e
2014-10-29 21:05:41 +09:00
Keisuke Kuroyanagi
dabc12974c
Merge "Improve space substitution error correction."
2014-10-28 09:26:40 +00:00
Keisuke Kuroyanagi
8a809f3433
Improve space substitution error correction.
...
Bug: 17432052
[Category diff]
+1 262
-1 93
+2 2
-2 18
+3 18
-3 2
+4 111
-4 148
+5 295
-5 217
+6 51
-6 276
+7 139
-7 124
[Weighted category diff]
+1 276
-1 100
+2 4
-2 20
+3 20
-3 4
+4 118
-4 160
+5 309
-5 225
+6 52
-6 298
+7 163
-7 135
show diff for ./en_user_log_phones_2011_08.csv
+1 173
-1 28
+2 2
-2 17
+3 17
-3 2
+4 63
-4 82
+5 120
-5 51
+6 24
-6 220
+7 88
-7 87
Change-Id: I9d673acb0ff632828ae2e0ead56e76e3a20411c6
2014-10-28 17:11:14 +09:00
Keisuke Kuroyanagi
3844f74aff
Fix: deleted PtNode handling in v403.
...
If a word is once deleted, the word never gets into the
personalized dictionaries due to this bug.
Change-Id: Ife4e3fe1ba0615b4135e6291d2151b0db7d3f940
2014-10-27 15:32:05 +09:00
Yohei Yukawa
69402dc992
Merge "Enable Address Sanitizer for native host test 2nd try"
2014-10-23 16:07:39 +00:00
Keisuke Kuroyanagi
e65973882d
Merge "Fix: Personalized dicts suggest invalid words with v403."
2014-10-23 10:33:09 +00:00
Keisuke Kuroyanagi
090c3819d7
Fix: Personalized dicts suggest invalid words with v403.
...
Bug: 14425059
Change-Id: I45ae00069dd3b7c461dd9a1f3558b96af0a1c975
2014-10-23 19:26:01 +09:00
Yohei Yukawa
5c4bec31d1
Enable Address Sanitizer for native host test 2nd try
...
This CL enables Address Sanitizer for native host test. Note that
production build is not affected with this change. ASan is enabled
only in static lib for test executables.
Change-Id: I2c8e99b8c55e611e86f74579f24a63ac949bb02d
2014-10-23 10:16:55 +00:00
Yohei Yukawa
2db1e56ff4
Merge "Stop building host native test in unbundled build"
2014-10-23 09:39:56 +00:00
Yohei Yukawa
ba35bb83a8
Stop building host native test in unbundled build
...
It turned out that building native code for host environment
is not supported in NDK build. Hence this CL makes the host
native test available only as a part of platform build to
avoid accidental build breakage in unbundled build.
BUG: 18095678
Change-Id: If608da166d5a478358e6890b8db526b4c2c0ab41
2014-10-23 18:31:06 +09:00
Keisuke Kuroyanagi
16cc3992d7
Use trigrams for personalization dict.
...
5Bug: 14425059
Change-Id: I73cf6904e569d60996a3b079f16ea6df0cb90f02
2014-10-23 14:32:45 +09:00
Yohei Yukawa
9c0b3419da
Merge "Revert "Enable ASan (Address Sanitizer) for native host test""
2014-10-22 10:54:18 +00:00
Yohei Yukawa
b9dc32ffd5
Revert "Enable ASan (Address Sanitizer) for native host test"
...
This reverts commit af2673f17d
because of build failure in tapas build.
Change-Id: Ib02931116181c98b35ce938e42d2376225e9b255
2014-10-22 10:51:33 +00:00
Yohei Yukawa
0672e8554f
Merge "Enable ASan (Address Sanitizer) for native host test"
2014-10-22 10:13:08 +00:00
Yohei Yukawa
af2673f17d
Enable ASan (Address Sanitizer) for native host test
...
This CL enables Address Sanitizer for native host test. Note that
production build is not affected with this change. ASan is enabled
only in static lib for test executables.
Change-Id: Idbe1f2e4502dfce9b6fb0253d7ebda8d37fbf84e
2014-10-22 19:08:58 +09:00
Keisuke Kuroyanagi
b5ef884fbb
Support dumping ngram entries.
...
Bug: 14425059
Change-Id: Ib03a0c3d166ed6f1e60c67127b28006d55143b6b
2014-10-22 18:15:53 +09:00
Keisuke Kuroyanagi
c9865785f4
Support ngram entry migration.
...
Bug: 14425059
Change-Id: I98cb9fa303af2d93a0a3512e8732231c564e3c5d
2014-10-22 11:31:16 +09:00
Keisuke Kuroyanagi
0b8bb0c21b
Fix debug build.
...
Change-Id: Id94636714d04a8828718b87741c0ee62a14cb3b4
2014-10-21 20:20:11 +09:00
Keisuke Kuroyanagi
dfc82fa366
Merge changes I210acb81,Ie9508788
...
* changes:
Make NgramProperty have NgramContext.
Create .cpp file for NgramContext.
2014-10-21 10:28:25 +00:00
Keisuke Kuroyanagi
88bb28c132
Make NgramProperty have NgramContext.
...
Bug: 14425059
Change-Id: I210acb816b122857dbbe1ee4dd6a35c5335bf2bf
2014-10-21 17:12:32 +09:00
Keisuke Kuroyanagi
f87bb77a91
Create .cpp file for NgramContext.
...
Bug: 14425059
Change-Id: Ie950878817b9c80cc9c970e1a84880c9b9ab228a
2014-10-21 17:04:56 +09:00
Keisuke Kuroyanagi
fa1e65cb3a
Merge "Use EntryCounters during GC."
2014-10-21 07:55:04 +00:00
Adrian Velicu
c51b9b5b3f
Merge "Renaming "blacklist" flag to "possibly offensive""
2014-10-21 07:39:18 +00:00
Keisuke Kuroyanagi
47fc656cd7
Use EntryCounters during GC.
...
Bug: 14425059
Change-Id: I61eb798686dc753fb6c0fe99a0719c1732198f30
2014-10-21 16:36:03 +09:00
Keisuke Kuroyanagi
e8750d970e
Introduce EntryCounters to count entries in a dictionary.
...
Bug: 14425059
Change-Id: Ic13ba827d96fa4a147485ba92fdb37e23e04e8e8
2014-10-21 15:46:14 +09:00
Adrian Velicu
05172bf1a5
Renaming "blacklist" flag to "possibly offensive"
...
No behaviour changes.
Unified the overloaded FusionDictionary::add method to always take an
isPossiblyOffensive argument.
Bug: 11031090
Change-Id: I5741a023ca1ce842d2cf10d4f6c926b0efabaa78
2014-10-21 11:51:47 +09:00
Keisuke Kuroyanagi
1085fef8d0
Change entry count limit.
...
Unigram 10K, Bigram 30K, Trigram 30K.
Change-Id: Ibd19c6a2b618499df1c70000bad7b47498187f0a
2014-10-20 15:01:49 +09:00
Keisuke Kuroyanagi
f4928ad4dd
Merge "Update useless n-gram entry detection logic during GC."
2014-10-15 21:44:45 +00:00
Keisuke Kuroyanagi
3601c214f8
Update useless n-gram entry detection logic during GC.
...
Bug: 14425059
Change-Id: Ib939deae5b60167751dee07965bb1ef1a43c4625
2014-10-15 20:43:27 +09:00
Keisuke Kuroyanagi
183e21c36c
Merge "Use better conditional probability for ngram entries."
2014-10-15 09:27:21 +00:00
Keisuke Kuroyanagi
72d17d9209
Use better conditional probability for ngram entries.
...
Old:
P(W | W_prev) = f(W, W_prev) + C
New:
P(W | W_prev) = f(W, W_prev) / f(W_prev)
Bug: 14425059
Bug: 16547409
Change-Id: I4d13be6de2c6bad6bad7fb22320a23ba4ecd361c
2014-10-15 18:23:00 +09:00
Keisuke Kuroyanagi
c2429c54ac
Merge "Move entry updating method to language model dict content."
2014-10-15 04:51:04 +00:00
Keisuke Kuroyanagi
5400701908
Move entry updating method to language model dict content.
...
Bug: 14425059
Change-Id: I710055490d141539458cbf968adf5a7ccffd9552
2014-10-15 12:29:31 +09:00
Keisuke Kuroyanagi
d8ccb9093b
Quit using weightChildNode for ADDITIONAL_PROXIMITY and SUBSTITUTION.
...
[Category diff]
+1 0
-1 1
+2 0
-2 0
+3 0
-3 0
+4 1
-4 1
+5 8
-5 7
+6 0
-6 1
+7 1
-7 0
[Weighted category diff]
+1 0
-1 1
+2 0
-2 0
+3 0
-3 0
+4 1
-4 1
+5 8
-5 7
+6 0
-6 1
+7 1
-7 0
Bug: 13756409
Change-Id: I6ac3567545676bbefbee3e87dda54bc083c15fb6
2014-10-14 20:20:55 +09:00
Keisuke Kuroyanagi
d1471ee053
Merge "Remove shouldBlockAutoCorrectionBySafetyNet"
2014-10-14 10:52:32 +00:00
Keisuke Kuroyanagi
29b4f7aa67
Remove shouldBlockAutoCorrectionBySafetyNet
...
Bug: 13756409
[Category diff]
+1 27
-1 0
+2 0
-2 0
+3 0
-3 1
+4 11
-4 0
+5 51
-5 0
+6 0
-6 38
+7 0
-7 50
[Weighted category diff]
+1 28
-1 0
+2 0
-2 0
+3 0
-3 1
+4 11
-4 0
+5 51
-5 0
+6 0
-6 39
+7 0
-7 50
show diff for ./en_user_log_phones_2011_08.csv
+1 4
+4 5
+5 7
-6 9
-7 7
The increase of false positives comes from the spaceless
typing test cases that are synthetic data.
Change-Id: I4ea77aa56ebfaa5518c71107169e1d2332de6327
2014-10-14 11:20:33 +09:00
Keisuke Kuroyanagi
d70b8ff291
Fix: BoS bigram from user history dictionary is too strong.
...
They can be always stronger than BoS predictions from the
contextual dictionary.
Bug: 17961731
Change-Id: I70297d82436c10c790bdfad6f3dfefdb4bb2f852
2014-10-13 08:52:08 +00:00
Jean Chalard
7d5e1cb265
[ML23] Introduce a different accuracy/performance tradeoff
...
Bug: 11230254
Change-Id: Ic09518c818ae7b68942b1c63160dd462e5922cb5
2014-10-10 18:02:52 +09:00
Keisuke Kuroyanagi
229f354fdc
Merge "Make members of classes that are used with std::vector const"
2014-10-10 05:39:57 +00:00
Keisuke Kuroyanagi
b559c65e7e
Merge "Rename prev_words_info.h to ngram_context.h"
2014-10-09 12:47:24 +00:00
Keisuke Kuroyanagi
10fa30e380
Merge "Rename PrevWordsInfo to NgramContext."
2014-10-09 12:47:17 +00:00
Keisuke Kuroyanagi
620ebde704
Make members of classes that are used with std::vector const
...
Change-Id: Id93fb87f5630230fc3f9cd339e12f3b0e2006ea9
2014-10-09 21:28:40 +09:00
Keisuke Kuroyanagi
45783013bf
Rename prev_words_info.h to ngram_context.h
...
Bug: 14425059
Change-Id: I0e906631ecad2361a8198b3f9e3394bb22c5bf83
2014-10-09 21:28:19 +09:00
Adrian Velicu
44efbe64b1
Fixing misspelled word
...
Change-Id: I51d77e271143d40256b39e5c60a3065d9fdf63fb
2014-10-09 19:26:54 +09:00
Keisuke Kuroyanagi
72e2383d11
Rename PrevWordsInfo to NgramContext.
...
Bug: 14425059
Change-Id: I30703fc80e9450d4e2dbfec965e7f9f4468f6a11
2014-10-09 17:34:32 +09:00
Keisuke Kuroyanagi
ab4437f468
Rename updateCounter to updateEntriesForWordWithNgramContext.
...
Bug: 14425059
Change-Id: Id9b0dd7e32c711ed4292981517c3febd5fe9e897
2014-10-09 17:34:29 +09:00
Keisuke Kuroyanagi
d7a51c242b
Extend jni method to dump ngram entries.
...
Bug: 14425059
Change-Id: Ib03b58525fae12f254b45630bee31ecbdde227b1
2014-10-09 15:26:10 +09:00
Keisuke Kuroyanagi
948ef10d03
Merge "Improve bigram probability computation for decaying dicts."
2014-10-06 13:06:29 +00:00
Keisuke Kuroyanagi
aae1a062eb
Improve bigram probability computation for decaying dicts.
...
Without personalization:
Total words: 1079345, Success Num: 819749, Success Percentage: 75.949%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1754, Bad Failure Percentage: 0.163%
Failures, with auto-correction (F-C): 28463, F-C Percentage: 2.637%
Max Keystrokes: 6074285, Min Keystrokes: 4649326, Keystroke Saving Percentage:23.459%
With current probability computing logic:
Total words: 1079382, Success Num: 838329, Success Percentage: 77.667%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1332, Bad Failure Percentage: 0.123%
Failures, with auto-correction (F-C): 28558, F-C Percentage: 2.646%
Max Keystrokes: 6074503, Min Keystrokes: 4474102, Keystroke Saving Percentage:26.346%
Remove isof files.
With new probability computing logic:
Total words: 1079356, Success Num: 844954, Success Percentage: 78.283%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1306, Bad Failure Percentage: 0.121%
Failures, with auto-correction (F-C): 27214, F-C Percentage: 2.521%
Max Keystrokes: 6074477, Min Keystrokes: 4243021, Keystroke Saving Percentage:30.150%
Remove isof files.
Bug: 16547409
Change-Id: I3d2a49c7aaa2c0f6835c52ef72d22466ee225789
2014-10-06 22:03:11 +09:00
Keisuke Kuroyanagi
552470c882
Merge "Make sure to suppress BoS prediction until input twice."
2014-10-06 10:46:31 +00:00
Keisuke Kuroyanagi
c7d199e770
Merge "Fix: BoS prediction is shown after inputting just once."
2014-10-06 10:38:18 +00:00
Keisuke Kuroyanagi
1c2f00f6b5
Make sure to suppress BoS prediction until input twice.
...
Change-Id: I98d91f264d5d1d3f5bcda1fd9ec885779ba2f746
2014-10-06 19:36:04 +09:00
Keisuke Kuroyanagi
ca17ed7d9d
Fix: BoS prediction is shown after inputting just once.
...
Change-Id: Ibba209f47cb5b1a4b08281689d607711b8dcfad4
2014-10-06 19:28:39 +09:00
Keisuke Kuroyanagi
36c4eaadfb
Show prediction results in debug build.
...
Bug: 16547409
Change-Id: If85418583998cd639c794bf5d5cfbbb972c34f72
2014-10-06 18:36:54 +09:00
Keisuke Kuroyanagi
16e1615301
Fix: ProbabilityDictContent can be wrongly large.
...
It can be twice as large as it should be (80KB larger).
Change-Id: If94f748f8c48a442b3c95ac989099aaed2aa2f86
2014-10-06 11:55:07 +09:00
Keisuke Kuroyanagi
3d7681d8e5
Merge "Remove -fno-inline from profiling build."
2014-10-03 09:35:14 +00:00
Jean Chalard
4ef27c0358
[ML13] Fix the locale passing in ProximityInfo
...
The locale is used to determine additional proximity characters. This
is dependent on the dictionary language, but was passed as a function
of the layout, which is wrong and would have given bad suggestions in
multi-lingual mode.
Ideally, additional proximity characters should be inserted in the
dictionary header, but for now it's a rather simple change to get
it from the dictionary's locale instead of the proximity info locale.
Also, that allows us to remove completely the locale parameter from
proximity info, which is a much needed change.
This change has zero effect on unit tests and on regression tests.
Bug: 11230254
Change-Id: If95157155db7dccd1f00b8ba55ccb3600283f9e4
2014-10-03 18:16:34 +09:00
Keisuke Kuroyanagi
3da610d5a6
Remove -fno-inline from profiling build.
...
Bug: 17797064
Change-Id: I0ab60e2da8c9f509de472afbc3ad32e41e69e73c
2014-10-03 18:16:19 +09:00
Keisuke Kuroyanagi
29777e3a8a
Implement updateCounter() by using existing entry adding methods.
...
Bug: 14425059
Change-Id: I0b6cb80e1fb8f738e9c7d9e80fbc0c479546b879
2014-10-01 19:59:39 +09:00
Keisuke Kuroyanagi
ce5fd94b97
Add jni method to update entries for a word efficiently.
...
Bug: 14425059
Change-Id: Ic628939ea68cdee17573409883a8403c73b78905
2014-10-01 17:40:20 +09:00
Keisuke Kuroyanagi
287e155e44
Move HistoricalInfo to property and use it in *Property.
...
Bug: 14425059
Change-Id: Icccccabad98fb543c6a6be2844cfc0086d80b739
2014-10-01 11:39:33 +09:00
Keisuke Kuroyanagi
79bb37d499
Rename BigramProperty to NgramProperty.
...
Remaining work is changing bigram to ngram for supporting
ngram entry counting, dumping, and migration.
Bug: 14425059
Change-Id: Ifba288a1166996d62a5e57698f63537ea0a2a8ee
2014-09-29 19:10:39 +09:00
Keisuke Kuroyanagi
cb4f544198
Quit reading unigram probability in Ver4PatriciaTrieNodeReader.
...
Bug: 14425059
Change-Id: I4fc7b0e236151a2c64e7131772264024c6597633
2014-09-25 11:41:50 +09:00
Keisuke Kuroyanagi
2842e50c4b
Use std::move for dictionary properties.
...
Change-Id: I15056b36b7493f4bac1dbcbb46a0b44343ede153
2014-09-25 11:36:52 +09:00
Keisuke Kuroyanagi
ed77b37e6b
Merge "Use CodePointArrayView in WordProperty."
2014-09-24 07:38:37 +00:00
Keisuke Kuroyanagi
7313b0debe
Merge "Refactoring method to get code points and code point count."
2014-09-24 07:24:48 +00:00
Keisuke Kuroyanagi
bd0ac97917
Fix: native unit test.
...
Bug: 14425059
Change-Id: I7c5103eac35e01a5732f1f4594d730df06dfca96
2014-09-24 15:26:26 +09:00
Keisuke Kuroyanagi
80d139a694
Use CodePointArrayView in WordProperty.
...
Change-Id: I45a9755c413003831788d190beb499fee8ce63aa
2014-09-24 14:15:36 +09:00
Keisuke Kuroyanagi
65a7ccfa00
Refactoring method to get code points and code point count.
...
Bug: 14425059
Change-Id: I4731bd6076d34556e46e6714180fed324fb6aba3
2014-09-24 14:15:36 +09:00
Keisuke Kuroyanagi
7d911d6f91
Move word flags to language model dict content.
...
Bug: 14425059
Change-Id: I64712e5c83d0bc241e6f0f16117ab47b5d75bd4b
2014-09-24 14:15:34 +09:00
Keisuke Kuroyanagi
ddfaeff544
Prepare supporting n-gram for user history dictionary.
...
Bug:17097992
Change-Id: Ic8bfde3d4cc0e720bf7681e08e16fb2ad94d5670
2014-09-22 18:18:50 +09:00
Keisuke Kuroyanagi
5a20827fc7
Fix native crash.
...
Change-Id: Ia8871db0248ddbc821605079e2a3e93859f4c9b7
2014-09-19 21:04:15 +09:00
Akifumi Yoshimoto
5c6db929e4
Merge "Native side reads character table"
2014-09-19 09:58:40 +00:00
Akifumi Yoshimoto
fb2bde5a68
Native side reads character table
...
Bug:17097992
Change-Id: Ibcfc67833a6754fe6a2d82a3e3023b33886b9ea2
2014-09-19 18:00:16 +09:00
Jean Chalard
6da9b21191
[ML8] Add a language weight
...
...and rename an improperly named normalization value
Bug: 11230254
Change-Id: I0f5633148a9f66dbfd7d28540b8a8985131c4549
2014-09-19 13:44:42 +09:00
Keisuke Kuroyanagi
9db2cb4a37
Make MAX_PREV_WORD_COUNT_FOR_N_GRAM 2.
...
Bug: 14425059
Change-Id: Iefacd7e4a6c256658a58856240310eeb91ae32db
2014-09-18 19:18:55 +09:00
Keisuke Kuroyanagi
8806248386
Pass prevWordCount to native code for getSuggestion.
...
Bug: 14425059
Change-Id: Id2b49a448982c997e5c0a22445b115e675156c3e
2014-09-18 19:18:55 +09:00
Keisuke Kuroyanagi
b349b57476
Merge "Use ReadOnlyByteArrayView in Ver2ParticiaTrieNodeReader."
2014-09-17 12:53:54 +00:00
Keisuke Kuroyanagi
a533e18c07
Merge "Use ReadOnlyByteArrayView in Ver2PtNodeArrayReader."
2014-09-17 12:53:48 +00:00
Keisuke Kuroyanagi
4a55d41410
Merge "Use ReadOnlyByteArrayView in ShortcutListPolicy"
2014-09-17 12:51:06 +00:00
Keisuke Kuroyanagi
c1040a255c
Merge "Use ReadOnlyByteArrayView in BigramListPolicy."
2014-09-17 12:50:51 +00:00
Keisuke Kuroyanagi
5849feeee1
Merge "Use ReadOnlyByteArrayView in PatriciaTriePolicy."
2014-09-17 12:50:16 +00:00
Keisuke Kuroyanagi
198a47a161
Merge "Add firstOrDefault and lastOrDefault to IntArrayView."
2014-09-17 12:43:10 +00:00
Keisuke Kuroyanagi
6258c57c32
Use ReadOnlyByteArrayView in Ver2ParticiaTrieNodeReader.
...
Change-Id: Ie3325bb9666c7ff6d519a1b72ed2fe9fc4f140a9
2014-09-17 21:42:30 +09:00
Keisuke Kuroyanagi
a892628fa3
Use ReadOnlyByteArrayView in Ver2PtNodeArrayReader.
...
Change-Id: Ia1065465abdf7c64fadb3ff1efd2a4d2df343264
2014-09-17 21:42:30 +09:00
Keisuke Kuroyanagi
59ebd51718
Use ReadOnlyByteArrayView in ShortcutListPolicy
...
Change-Id: I03a6f49c9005306bcc5fce8b7e4d37b8d30b9faa
2014-09-17 21:42:29 +09:00
Keisuke Kuroyanagi
d01eb3c94c
Use ReadOnlyByteArrayView in BigramListPolicy.
...
Change-Id: I0f8e8044c8803a7a542f97be5b8f08fdb5ae3926
2014-09-17 21:42:26 +09:00
Keisuke Kuroyanagi
180e7b4c07
Use ReadOnlyByteArrayView in PatriciaTriePolicy.
...
Change-Id: I4ef01c0af73f27c256cc1a719343be836913dc71
2014-09-17 21:38:46 +09:00
Keisuke Kuroyanagi
09c154925f
Add firstOrDefault and lastOrDefault to IntArrayView.
...
Change-Id: I854c02eff3fa0b53c72a5f1cabce001f4854ada0
2014-09-17 21:16:31 +09:00
Keisuke Kuroyanagi
fc7d0540fe
Use CodePointArrayView in DictionaryUtils.
...
Change-Id: I9ae308e60124ea5acb4ee09847c4fdd58ff168e2
2014-09-17 20:13:36 +09:00
Keisuke Kuroyanagi
3e75c59133
Use CodePointArrayView in Dictionary.
...
Change-Id: I63fa0a8348f6de6ec7a424a8033e936b4af72beb
2014-09-17 20:13:36 +09:00
Keisuke Kuroyanagi
89a074fade
Use IntArrayView in DynamicPtUpdatingHelper.
...
Change-Id: Ifa185eba7dd9abee1b35e49fe559be1042ca63d7
2014-09-17 20:13:36 +09:00
Keisuke Kuroyanagi
4926b90ec5
Support n-gram for look-up.
...
Bug: 14425059
Change-Id: I19523c29fb802cd65158c7540d1608e7f55c4ca7
2014-09-17 16:20:00 +09:00
Keisuke Kuroyanagi
5520e84e16
Support removing n-gram entry in structure policy.
...
Bug: 14425059
Change-Id: Ifd2ac97a15dcbb936e2f068866a9d1ee68759ba2
2014-09-16 15:46:28 +09:00
Keisuke Kuroyanagi
0c0b8207cd
Support adding n-gram entry in structure policy.
...
Bug: 14425059
Change-Id: Ia207d3c4735b1d6d43f18d18a70d28af613cb458
2014-09-16 15:37:09 +09:00
Keisuke Kuroyanagi
7eb6e28b9e
Update input checking logic for adding/removing n-gram.
...
Bug: 14425059
Change-Id: Ifb4e3c3f06aaf9a16f632ff208ba76cf741cea83
2014-09-16 14:14:22 +09:00
Keisuke Kuroyanagi
d2230525bc
Have mPrevWordCount in DicNodeProperties.
...
Bug: 14425059
Change-Id: I5ce22bace4ec08d0da4e5c167288a742c4426c33
2014-09-16 12:46:16 +09:00
Keisuke Kuroyanagi
36ba139ca6
Support decaying dict in getWordProbability().
...
Bug: 14425059
Change-Id: I24db3f9131c2999fc388035dc365c7faaef3bdb1
2014-09-14 17:29:50 +09:00
Keisuke Kuroyanagi
c43b6664fa
Use passed previous word count in PrevWordsInfo.
...
Bug: 14425059
Change-Id: I04007bdacf0176a05be7a27ef1c20c5b851d8bed
2014-09-14 17:29:38 +09:00
Keisuke Kuroyanagi
537f6eea8a
Use WordIdArrayView for prevWordIds.
...
Bug: 14425059
Change-Id: Ia84fb997d89564e60111b46ca83bbfa3b187f316
2014-09-11 19:36:22 +09:00
Keisuke Kuroyanagi
a3b0eb1685
Implement IntArrayView.copyToBuffer().
...
Bug: 14425059
Change-Id: Ic1b7de72701eae0c0b1bb28d12514099446a66a9
2014-09-11 15:21:52 +09:00
Keisuke Kuroyanagi
d53aea5af9
Remove unigram probability from dicNode.
...
Bug: 14425059
Change-Id: Ie848e8568bb4dbb1d8358e823a881d9157a1aad3
2014-09-10 21:21:25 +09:00
Keisuke Kuroyanagi
c32356c229
Quit using dicNode.getUnigramProbability().
...
Bug: 14425059
Change-Id: I192070cc11e5d46c8413ebc19982d6a8c93577fc
2014-09-10 21:21:25 +09:00
Keisuke Kuroyanagi
521e2382da
Use CodePointArrayView to create children DicNodes.
...
Change-Id: Ie940b6595f3f3f804fbb8dd03c710ea062b75af3
2014-09-10 21:21:23 +09:00
Keisuke Kuroyanagi
87a5c76906
Use WordAttributes for checking flags.
...
Bug: 14425059
Change-Id: Idee84478a482a0e7b5cc53e5dbd4e2484584ba79
2014-09-10 19:51:57 +09:00
Keisuke Kuroyanagi
2111e3abc9
Introduce WordAttributes to get word probability and flags.
...
Bug: 14425059
Change-Id: Iee11d038e0893d7ddd6c52447907f8c55fecb6a5
2014-09-10 19:51:48 +09:00
Keisuke Kuroyanagi
11a48f92a5
Use getProbabilityOfWordInContext for prediction.
...
Bug: 14425059
Change-Id: I9d5c905a0adda3503c593bfbf0bb9af8d1686f5d
2014-09-10 19:51:14 +09:00
Keisuke Kuroyanagi
395fe8e98d
Implement LanguageModelDictContent.getWordProbability().
...
Bug: 14425059
Change-Id: I290a05cee6f341caa25fb222892505529cef1eb7
2014-09-10 19:51:12 +09:00
Keisuke Kuroyanagi
9f8da0f833
Use MultiBigramMap in structure policy.
...
Bug: 14425059
Change-Id: I4d78da4839ef177e0223e6e5bcf0ebd7315c3099
2014-09-09 17:53:44 +09:00
Keisuke Kuroyanagi
138647876b
Merge "Rename probability to unigramProbability."
2014-09-09 06:16:08 +00:00
Keisuke Kuroyanagi
72ee1c7ee5
Merge "Remove mHasChildrenPtNodes from DicNodeProperties."
2014-09-09 06:03:36 +00:00
Keisuke Kuroyanagi
3e6c5edcf9
Merge "Add FormatUtilsTest."
2014-09-09 05:24:28 +00:00
Keisuke Kuroyanagi
5d95e8d747
Add FormatUtilsTest.
...
Change-Id: I39ee1d18e0b70d9f4504d29a94cabc9354e4527e
2014-09-09 14:10:53 +09:00
Keisuke Kuroyanagi
9c42ad47d4
Rename probability to unigramProbability.
...
Bug: 14425059
Change-Id: I6a204c3b8fb257d037ad95a1a455ae6fb89068fd
2014-09-09 14:09:01 +09:00
Keisuke Kuroyanagi
d028294890
Remove mHasChildrenPtNodes from DicNodeProperties.
...
Bug: 14425059
Change-Id: I3a9511e7f7c3a722f9942f525530f04def5965da
2014-09-09 14:08:41 +09:00
Keisuke Kuroyanagi
3f779718cc
Merge "Remove DicNode.getPtNodePos()."
2014-09-05 10:15:45 +00:00
Keisuke Kuroyanagi
9ff6fee838
Remove DicNode.getPtNodePos().
...
Bug: 14425059
Change-Id: If6e291d23e68342792febb85f8a576ce785b3845
2014-09-05 17:27:01 +09:00
Keisuke Kuroyanagi
5cfa023777
Merge "Add HeaderReadWriteUtilsTest"
2014-09-04 07:30:59 +00:00