INDEX
Explanations
mentions of religion, particularly in various contexts
New Auto-Interp
Negative Logits
Nichols
-0.75
sai
-0.62
வர்
-0.61
OutOfRange
-0.61
Tilt
-0.60
sucht
-0.59
sze
-0.59
ی
-0.58
{//-0.57
toimi
-0.57
POSITIVE LOGITS
itſelf
1.27
myſelf
1.09
Majefty
1.04
pleaſure
1.02
Jefus
1.02
fubject
0.98
theless
0.97
purpoſe
0.96
houſe
0.95
Efq
0.90
Activations Density 0.002%