INDEX
Explanations
phrases related to specific names, particularly those that seem to be associated with political or military figures
references to specific individuals, particularly those with the name "Naw" or "Daw."
New Auto-Interp
Negative Logits
Prism
-0.67
Juno
-0.67
Gabriel
-0.64
Purg
-0.62
grape
-0.60
ANGEL
-0.60
¯¯¯¯¯¯¯¯
-0.60
Pope
-0.59
Kaiser
-0.59
Sacrament
-0.59
POSITIVE LOGITS
lins
1.05
lat
1.03
dat
1.01
da
0.98
ez
0.96
ees
0.95
trak
0.94
dh
0.94
die
0.93
awi
0.93
Activations Density 0.052%