INDEX
Explanations
the word "Sin"
mentions of "Sin" in various contexts
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.92
dropping
-0.78
ĵĺ
-0.78
enance
-0.76
¶ħ
-0.74
deficits
-0.71
downed
-0.68
lished
-0.67
sound
-0.67
orld
-0.66
POSITIVE LOGITS
ners
1.14
ned
1.02
clair
1.02
atra
0.98
estro
0.96
ister
0.94
uous
0.93
ews
0.91
ja
0.88
oma
0.87
Activations Density 0.011%