INDEX
Explanations
specific words related to the military and locations such as "Pike" and "Shot."
references to specific products or brands
New Auto-Interp
Negative Logits
enegger
-0.82
schild
-0.81
enance
-0.76
binding
-0.71
crore
-0.70
orage
-0.69
NUM
-0.68
ieu
-0.68
naire
-0.67
IFIED
-0.67
POSITIVE LOGITS
oples
1.18
ggy
1.09
pe
0.96
cies
0.93
anut
0.91
levard
0.90
achy
0.88
aky
0.88
utics
0.87
ck
0.87
Activations Density 0.011%