INDEX
Explanations
the pronoun "it" used in various contexts
New Auto-Interp
Negative Logits
iefs
-0.16
velle
-0.15
Latter
-0.15
apot
-0.14
á»iji
-0.14
edom
-0.14
ongan
-0.14
à¥ĩण
-0.13
lass
-0.13
essor
-0.13
POSITIVE LOGITS
happen
0.25
clear
0.25
onto
0.24
known
0.22
possible
0.22
appen
0.21
count
0.20
past
0.20
official
0.20
easier
0.20
Activations Density 0.022%