INDEX
Explanations
mentions of military-related terms
words related to military forces and activities
New Auto-Interp
Negative Logits
Mist
-0.76
âĸ¬
-0.76
BOOK
-0.74
coli
-0.74
Bake
-0.68
Lot
-0.68
Rate
-0.63
··
-0.63
clipse
-0.62
PER
-0.62
POSITIVE LOGITS
arily
1.15
ament
1.02
iam
0.98
ications
0.93
ician
0.93
antly
0.90
ancy
0.89
ategic
0.87
rior
0.86
milit
0.85
Activations Density 0.011%