INDEX
Explanations
geographical locations and identifiers related to conflict zones
New Auto-Interp
Negative Logits
Grateful
-0.68
crank
-0.67
sych
-0.67
Knight
-0.64
MPH
-0.64
ãĤ´ãĥ³
-0.64
Roberts
-0.64
Logged
-0.63
Weaver
-0.63
rawdownloadcloneembedreportprint
-0.63
POSITIVE LOGITS
ascus
0.99
ilan
0.92
awi
0.90
etsk
0.90
ussia
0.86
assies
0.85
uala
0.83
akh
0.83
unpop
0.82
abul
0.82
Activations Density 0.101%