INDEX
Explanations
verbs related to actions or abilities
indications of actions or states relating to performance or capabilities
New Auto-Interp
Negative Logits
ð
-0.75
bush
-0.66
Verse
-0.66
mpire
-0.65
thal
-0.65
whe
-0.63
conn
-0.63
ire
-0.63
pal
-0.63
ry
-0.61
POSITIVE LOGITS
moreover
1.24
furthermore
1.22
also
1.18
therefore
1.06
however
0.99
meanwhile
0.91
additionally
0.90
certainly
0.85
thus
0.79
also
0.76
Activations Density 0.531%