INDEX
Explanations
expressions of possession or relationships
New Auto-Interp
Negative Logits
meriva
-0.72
lamella
-0.66
abbildung
-0.66
ignoire
-0.65
octaves
-0.64
betweenstory
-0.64
omnia
-0.63
jaya
-0.63
Hodgkin
-0.63
INSEE
-0.63
POSITIVE LOGITS
been
1.22
not
1.12
gonna
0.96
also
0.95
really
0.94
'])
0.93
"])
0.92
s
0.92
a
0.91
is
0.91
Activations Density 0.199%