INDEX
Explanations
instances of the word "for" used in various contexts
New Auto-Interp
Negative Logits
LIST
-0.14
olon
-0.14
lices
-0.14
iversit
-0.14
lice
-0.14
má
-0.14
št
-0.14
conj
-0.13
Ents
-0.13
iterals
-0.13
POSITIVE LOGITS
sea
0.17
GIVEN
0.16
ours
0.16
ibbon
0.16
üzel
0.16
ayette
0.15
ourn
0.15
è¿İ
0.15
ôte
0.14
문
0.14
Activations Density 0.091%