INDEX
Explanations
phrases related to prayers and religious expressions
terms associated with beliefs and pleas for support
New Auto-Interp
Negative Logits
Mines
-0.83
xual
-0.82
olan
-0.76
izont
-0.73
mental
-0.71
neys
-0.70
ney
-0.70
lasses
-0.69
ohyd
-0.68
agle
-0.68
POSITIVE LOGITS
supp
0.78
acebook
0.78
-+-+-+-+
0.76
lication
0.74
prayers
0.74
enance
0.73
precaution
0.72
eb
0.70
cloth
0.69
assumption
0.65
Activations Density 0.037%