INDEX
Explanations
references to personal pronouns and their usage in emotional contexts
New Auto-Interp
Negative Logits
Vid
-0.17
oret
-0.16
certainly
-0.15
nÃŃk
-0.15
convex
-0.15
Hurt
-0.14
vid
-0.14
commercials
-0.14
onne
-0.14
atre
-0.14
POSITIVE LOGITS
/us
0.20
azo
0.16
åĢij
0.15
amet
0.15
itas
0.15
ocol
0.14
({...0.13
_simps
0.13
hết
0.13
fol
0.13
Activations Density 0.289%