INDEX
Explanations
references to military operations and conflicts, particularly related to Iraq and Afghanistan
New Auto-Interp
Negative Logits
lage
-0.18
Patron
-0.16
protection
-0.15
patron
-0.15
teaching
-0.14
Dig
-0.14
imo
-0.14
Dig
-0.14
ams
-0.14
Dude
-0.14
POSITIVE LOGITS
Farrell
0.16
edback
0.15
Ot
0.15
.Abstractions
0.14
reib
0.14
ानन
0.14
alta
0.14
apes
0.14
ê·¼
0.14
è¿ij
0.14
Activations Density 0.023%