INDEX
Explanations
proper nouns or names, specifically those related to the name "Rein."
references to specific names or individuals, particularly "Rein" and "Evangelicals."
New Auto-Interp
Negative Logits
Seym
-0.78
tub
-0.77
ategory
-0.71
creen
-0.70
milo
-0.70
mileage
-0.66
ombat
-0.65
recy
-0.65
estial
-0.64
bish
-0.64
POSITIVE LOGITS
forcement
1.14
hardt
1.08
acher
0.96
thal
0.92
forcer
0.92
vention
0.89
etta
0.84
Rein
0.81
ception
0.81
forced
0.79
Activations Density 0.017%