INDEX
Explanations
phrases related to legal and military contexts
statements or opinions regarding a person's actions or decisions
New Auto-Interp
Negative Logits
whip
-0.87
XM
-0.84
pony
-0.83
Pence
-0.83
ponies
-0.82
ince
-0.82
Mississ
-0.79
anch
-0.75
Sha
-0.75
ho
-0.74
POSITIVE LOGITS
Ber
2.44
Ber
2.35
Berger
1.98
Ger
1.64
Ger
1.59
Berg
1.54
Bern
1.51
Bern
1.50
ber
1.31
Bert
1.30
Activations Density 0.178%