INDEX
Explanations
words related to political and social conflicts
New Auto-Interp
Negative Logits
âĺħâĺħ
-0.71
Bethesda
-0.62
FI
-0.62
Shades
-0.61
ja
-0.59
fry
-0.59
ãģŁ
-0.59
perature
-0.59
ochet
-0.58
potatoes
-0.58
POSITIVE LOGITS
xon
1.28
xus
1.22
seed
1.03
avier
0.98
illary
0.97
posure
0.95
angel
0.91
es
0.91
endale
0.90
xes
0.90
Activations Density 0.019%