INDEX
Explanations
mentions of the name "Ruth" and related variations
New Auto-Interp
Negative Logits
owie
-0.17
iffin
-0.17
ì¶
-0.16
ifton
-0.16
eur
-0.15
ez
-0.15
sovereignty
-0.15
θεν
-0.14
elho
-0.14
.Internal
-0.14
POSITIVE LOGITS
lessly
0.29
lessness
0.22
ie
0.20
less
0.20
ledge
0.19
ann
0.19
fully
0.19
fulness
0.17
rough
0.17
quake
0.17
Activations Density 0.005%