INDEX
Explanations
terms related to politics and government
the character 'Ļ'
New Auto-Interp
Negative Logits
gist
-0.65
promoters
-0.64
anium
-0.62
imitation
-0.61
odium
-0.57
sophistication
-0.57
raviolet
-0.57
range
-0.57
eanor
-0.57
loopholes
-0.56
POSITIVE LOGITS
Ļ
1.29
女
1.18
Ľ
1.02
ļ
1.00
ķ
0.99
ħ
0.99
çIJ
0.94
ı
0.94
º
0.92
İ
0.92
Activations Density 0.537%