INDEX
Explanations
references to soldiers and military-related content
New Auto-Interp
Negative Logits
TERN
-0.78
omial
-0.72
âĶĢâĶĢâĶĢâĶĢ
-0.70
aminer
-0.66
CAST
-0.63
tera
-0.62
srfAttach
-0.60
Solution
-0.60
âĶģ
-0.58
SOURCE
-0.57
POSITIVE LOGITS
stationed
0.97
fatig
0.85
uniforms
0.73
barracks
0.73
garrison
0.72
doms
0.66
guarding
0.66
soldiers
0.66
reserv
0.65
colonel
0.64
Activations Density 7.825%