INDEX
Explanations
phrases related to negation
negative prefixes attached to adjectives or nouns
New Auto-Interp
Negative Logits
ħĭ
-0.91
ulhu
-0.88
Lumpur
-0.77
Dickinson
-0.76
Zup
-0.76
Cassidy
-0.75
ĸļ
-0.71
wagen
-0.69
Brus
-0.68
essee
-0.67
POSITIVE LOGITS
existent
1.27
profit
1.05
conscious
1.00
zero
0.99
issue
0.98
exclusive
0.98
human
0.97
issues
0.96
responsive
0.95
cont
0.94
Activations Density 0.034%