INDEX
Explanations
text related to politics and international relations
New Auto-Interp
Negative Logits
seiz
-0.74
myster
-0.73
scattering
-0.71
sacrific
-0.69
Mobil
-0.64
Tanz
-0.63
achus
-0.62
Negro
-0.61
notor
-0.61
federation
-0.61
POSITIVE LOGITS
ï¸ı
1.10
¯
0.94
ttle
0.88
s
0.84
payer
0.76
mental
0.76
ti
0.72
endif
0.71
tal
0.70
tarian
0.70
Activations Density 1.170%