INDEX
Explanations
mentions of media personalities and news events
instances of the special character 'âĢ'
New Auto-Interp
Negative Logits
anwhile
-0.72
segreg
-0.61
Sic
-0.60
Franch
-0.58
scattering
-0.57
Constantin
-0.57
psychiat
-0.57
Afric
-0.56
Mous
-0.56
Tanz
-0.56
POSITIVE LOGITS
¬
1.13
ľ
1.13
Ń
1.02
Ķ
0.97
¦
0.96
¡
0.96
ĺ
0.95
º
0.95
£
0.93
ķ
0.92
Activations Density 0.308%