INDEX
Explanations
locations or places
phrases related to arrests and legal situations
New Auto-Interp
Negative Logits
¥µ
-0.69
catentry
-0.67
He
-0.66
he
-0.66
ħĭ
-0.64
ython
-0.63
his
-0.62
Ô
-0.61
inav
-0.61
hes
-0.61
POSITIVE LOGITS
respectively
0.84
Their
0.81
Both
0.80
Their
0.79
Fleming
0.77
Nichols
0.77
Goodwin
0.74
Payne
0.74
Ms
0.74
trio
0.74
Activations Density 0.572%