INDEX
Explanations
past-tense verbs that indicate actions or events
New Auto-Interp
Negative Logits
á»ħ
-0.15
icut
-0.15
unzip
-0.15
ÃŃc
-0.14
resse
-0.14
gL
-0.14
LENG
-0.14
apol
-0.13
Roberto
-0.13
dress
-0.13
POSITIVE LOGITS
aly
0.15
egal
0.15
iá»ĥn
0.15
chas
0.15
chan
0.14
enty
0.14
/goto
0.14
åł
0.14
chg
0.14
hlen
0.14
Activations Density 0.055%