INDEX
Explanations
phrases related to warfare and military terms
references to warfare and military operations
New Auto-Interp
Negative Logits
ablishment
-0.73
ogene
-0.73
apo
-0.73
real
-0.73
utch
-0.73
idem
-0.71
clair
-0.71
thia
-0.71
jamin
-0.70
aye
-0.70
POSITIVE LOGITS
WAR
0.97
NER
0.97
RI
0.96
NING
0.95
MET
0.93
ER
0.89
CENT
0.88
FANTASY
0.87
RAG
0.87
FIELD
0.86
Activations Density 0.006%