INDEX
Explanations
prepositions and articles around certain words
prepositions and phrases that indicate location or direction
New Auto-Interp
Negative Logits
icter
-0.78
quartered
-0.70
Entered
-0.62
dor
-0.62
ratulations
-0.59
ievers
-0.58
ctors
-0.57
enment
-0.57
ounter
-0.56
ereo
-0.55
POSITIVE LOGITS
ourselves
1.07
yourselves
1.03
myself
1.00
yourself
0.97
Ħ¢
0.73
anyway
0.71
herself
0.68
themselves
0.68
own
0.67
anyways
0.66
Activations Density 0.373%