INDEX
Explanations
locations and organizations related to military and water
mentions of athletic events or athletes
New Auto-Interp
Negative Logits
walking
-0.70
ARM
-0.66
MG
-0.66
EAR
-0.65
ONG
-0.65
tarians
-0.64
Handler
-0.63
Counter
-0.63
loaded
-0.61
ocrates
-0.61
POSITIVE LOGITS
teenth
1.04
nces
0.94
itud
0.82
umar
0.81
uckland
0.77
ces
0.77
teen
0.76
itudes
0.75
itude
0.75
ngth
0.75
Activations Density 0.034%