INDEX
Explanations
phrases indicating proximity or nearness
repeated phrases indicating proximity or closeness
New Auto-Interp
Negative Logits
birds
-0.65
behav
-0.65
bet
-0.63
!!!!!!!!
-0.62
apo
-0.62
uploads
-0.60
NS
-0.59
nep
-0.59
RAW
-0.58
corrid
-0.58
POSITIVE LOGITS
sighted
0.74
heels
0.71
scrutiny
0.68
entin
0.68
sidx
0.66
othal
0.66
izabeth
0.65
amaru
0.65
ricular
0.65
shave
0.65
Activations Density 0.117%