INDEX
Explanations
countries or regions
references to specific geographic locations and political entities
New Auto-Interp
Negative Logits
requisite
-0.60
cause
-0.60
into
-0.58
planet
-0.58
},"
-0.57
him
-0.56
aciously
-0.55
invade
-0.55
orio
-0.54
-+
-0.53
POSITIVE LOGITS
there
0.99
meanwhile
0.91
they
0.88
we
0.83
there
0.75
it
0.73
however
0.72
THERE
0.70
THEY
0.67
,
0.67
Activations Density 0.372%