INDEX
Explanations
long verbs related to actions or decisions
verbs indicating actions or processes related to contributions or changes
New Auto-Interp
Negative Logits
grouping
-0.72
succeeding
-0.70
attaching
-0.68
unveiling
-0.67
requesting
-0.66
wounding
-0.66
cia
-0.66
bursting
-0.64
confir
-0.62
jet
-0.62
POSITIVE LOGITS
lass
0.80
redients
0.80
edge
0.66
GGGGGGGG
0.62
directions
0.60
tons
0.60
Edge
0.59
rods
0.59
irrelevant
0.59
allery
0.58
Activations Density 0.367%