INDEX
Explanations
relative pronouns and their usage in sentences
New Auto-Interp
Negative Logits
ALA
-0.16
Abrams
-0.15
adio
-0.14
spots
-0.14
ossal
-0.14
onne
-0.13
Spoon
-0.13
Fisher
-0.13
Whites
-0.13
-spot
-0.13
POSITIVE LOGITS
šov
0.18
elage
0.15
ůj
0.15
ully
0.15
ÙħÛĮÙĦادÛĮ
0.15
ften
0.14
kop
0.14
ande
0.14
èıĮ
0.14
¥
0.14
Activations Density 0.026%