INDEX
Explanations
information related to political figures and events
New Auto-Interp
Negative Logits
MAS
-0.84
swick
-0.83
MAP
-0.79
ACA
-0.75
Nap
-0.75
STEP
-0.74
Times
-0.72
efe
-0.71
STAT
-0.70
ç«
-0.70
POSITIVE LOGITS
possible
0.86
regards
0.85
practicable
0.83
contacting
0.71
outsiders
0.71
replacements
0.69
localization
0.69
respecting
0.68
anyone
0.68
interpreting
0.66
Activations Density 0.022%