INDEX
Explanations
phrases related to spiritual or religious expressions
New Auto-Interp
Negative Logits
Fres
-0.16
emento
-0.16
.azure
-0.16
oten
-0.15
abbo
-0.15
Z
-0.14
interop
-0.14
inters
-0.14
urrect
-0.14
machine
-0.14
POSITIVE LOGITS
dv
0.17
rog
0.16
Jag
0.15
Birth
0.15
rik
0.15
hti
0.15
abh
0.15
dbh
0.15
birth
0.14
rád
0.14
Activations Density 0.020%