INDEX
Explanations
mentions of locations or entities with the prefix "Mo"
occurrences of the word "Mo" or related constructs
New Auto-Interp
Negative Logits
responsibility
-0.71
Icelandic
-0.69
Identification
-0.68
Integrity
-0.68
Prohibition
-0.66
Discrimination
-0.62
caution
-0.62
Prelude
-0.62
stood
-0.61
envy
-0.60
POSITIVE LOGITS
ose
1.14
oney
1.06
oby
1.00
omon
0.97
ogly
0.93
osh
0.93
ascus
0.92
oser
0.92
utes
0.91
oses
0.90
Activations Density 0.012%