INDEX
Explanations
references to spatial relationships involving the word "next"
instances of the word "next."
New Auto-Interp
Negative Logits
lee
-0.70
Feldman
-0.66
ocker
-0.65
bach
-0.64
Legions
-0.64
lees
-0.63
hist
-0.63
Hebdo
-0.63
Sinai
-0.62
ribution
-0.62
POSITIVE LOGITS
millenn
0.95
etheless
0.91
week
0.89
step
0.87
ĻĤ
0.85
srf
0.82
installment
0.80
neighb
0.79
morning
0.79
generation
0.78
Activations Density 0.040%