INDEX
Explanations
proper nouns related to people, places, or entities
proper nouns, particularly names and brands
New Auto-Interp
Negative Logits
anwhile
-0.72
OPLE
-0.71
charge
-0.71
separatist
-0.70
ħĭ
-0.68
retaliate
-0.68
etheless
-0.67
ãĥ¼ãĥĨ
-0.67
depreciation
-0.66
grievance
-0.65
POSITIVE LOGITS
ona
1.06
isha
0.94
oya
0.93
amo
0.90
ado
0.90
leys
0.90
ley
0.89
inda
0.88
onda
0.87
udos
0.87
Activations Density 0.467%