INDEX
Explanations
references to alcoholic beverages and their effects
New Auto-Interp
Negative Logits
å¹²
-0.16
avra
-0.15
álo
-0.15
INO
-0.14
gifs
-0.14
оÑı
-0.14
enco
-0.14
)âĢı
-0.14
ìĦł
-0.14
olik
-0.14
POSITIVE LOGITS
Genesis
0.32
Acts
0.31
Romans
0.29
Genesis
0.29
Ps
0.29
Acts
0.28
Matthew
0.28
Numbers
0.27
Luke
0.27
Judges
0.27
Activations Density 0.350%