INDEX
Explanations
phrases containing words related to changes or actions, often with a consequential tone
commas in the text
New Auto-Interp
Negative Logits
¬¼
-0.66
ulz
-0.64
antz
-0.64
grain
-0.61
hoe
-0.60
oons
-0.60
corn
-0.59
asses
-0.58
cott
-0.58
ge
-0.57
POSITIVE LOGITS
albeit
1.20
namely
1.00
although
0.86
however
0.84
though
0.83
according
0.80
respectively
0.79
regardless
0.79
including
0.79
barring
0.77
Activations Density 0.492%