INDEX
Explanations
military-related terms and ranks
New Auto-Interp
Negative Logits
anca
-0.16
steen
-0.15
reins
-0.15
zdy
-0.15
ladu
-0.15
udo
-0.14
chwitz
-0.14
zsche
-0.14
ynamodb
-0.14
ardo
-0.13
POSITIVE LOGITS
Fur
0.15
abe
0.15
Granite
0.14
572
0.14
Sawyer
0.13
è¥
0.13
-equipped
0.13
146
0.13
111
0.13
ouz
0.13
Activations Density 0.033%