INDEX
Explanations
phrases and terms related to political dynamics and their implications on society
New Auto-Interp
Negative Logits
sekali
-0.40
Aholisi
-0.40
routs
-0.33
Universitario
-0.33
geïsole
-0.33
Vidite
-0.33
végétale
-0.33
intérieure
-0.32
muur
-0.32
trouw
-0.32
POSITIVE LOGITS
anymore
0.85
necessarily
0.67
__':
0.64
nor
0.64
المعيارى
0.60
ModelExpression
0.59
__":
0.59
zzleHttp
0.59
InjectAttribute
0.54
__':
0.53
Activations Density 1.534%