INDEX
Explanations
references to Muammar Qaddafi and his regime
New Auto-Interp
Negative Logits
asal
-0.15
.ca
-0.15
agle
-0.15
anna
-0.15
ibus
-0.14
awl
-0.14
Gemini
-0.14
ãĥ©ãĥ¼
-0.14
704
-0.13
aks
-0.13
POSITIVE LOGITS
Yao
0.15
ischer
0.14
acey
0.14
Ø·Ùħ
0.14
obstacle
0.13
regimes
0.13
gravity
0.13
shima
0.13
alin
0.13
bott
0.13
Activations Density 0.055%