INDEX
Explanations
mentions of military contexts or topics
military domains and applications
New Auto-Interp
Negative Logits
Anlass
-0.60
jaqueta
-0.60
kohta
-0.58
doğum
-0.57
Câu
-0.57
querida
-0.56
rêves
-0.55
rubrique
-0.54
tonode
-0.54
Pons
-0.54
POSITIVE LOGITS
Military
0.84
military
0.82
Military
0.80
military
0.76
MILITARY
0.75
militar
0.73
Militar
0.62
Milit
0.55
ilitary
0.55
militari
0.51
Activations Density 0.006%