INDEX
Explanations
the word "Loretta" followed by various surnames
the presence of specific names or terms associated with individuals, particularly in a legal or governmental context
New Auto-Interp
Negative Logits
nown
-0.81
urat
-0.73
reau
-0.71
gart
-0.68
tempting
-0.68
reimburse
-0.66
odder
-0.62
axter
-0.62
iago
-0.62
rolet
-0.61
POSITIVE LOGITS
士
0.72
scope
0.66
vironment
0.64
åĬ
0.64
ktop
0.64
atural
0.63
Arabian
0.61
esis
0.61
Strauss
0.61
iae
0.59
Activations Density 0.173%