INDEX
Explanations
quotations and statements from individuals
New Auto-Interp
Negative Logits
yani
-0.15
orsi
-0.14
ãĢģ“
-0.14
heim
-0.14
umo
-0.14
ught
-0.14
ourselves
-0.14
нами
-0.13
unseren
-0.13
yx
-0.13
POSITIVE LOGITS
although
0.28
there
0.26
while
0.23
although
0.22
:
0.21
it
0.21
Although
0.20
while
0.18
despite
0.18
While
0.18
Activations Density 0.156%