INDEX
Explanations
verbs ending in 'ing' or adjectives ending in 'ack', 'aps', 'ep', 'ed', 'ing', or 'unk'
colloquial or informal expressions of negative interactions or sentiments
New Auto-Interp
Negative Logits
Colo
-0.66
cryst
-0.62
SLI
-0.61
OTOS
-0.61
IJ
-0.59
Pa
-0.58
izoph
-0.58
SW
-0.58
URA
-0.57
CRE
-0.57
POSITIVE LOGITS
creen
0.93
iness
0.93
weed
0.91
ucker
0.90
buck
0.90
ily
0.89
ishly
0.88
manship
0.87
havoc
0.84
stakes
0.84
Activations Density 0.497%