INDEX
Explanations
specific information about different categories or entities, focusing on comparisons or rankings
repeated instances of the word "each."
New Auto-Interp
Negative Logits
bows
-0.73
stones
-0.73
ahs
-0.72
mares
-0.72
children
-0.70
haven
-0.70
models
-0.69
boys
-0.69
ELF
-0.69
lights
-0.68
POSITIVE LOGITS
successive
1.30
respective
1.11
iteration
1.10
individual
1.05
element
0.95
succeeding
0.93
dimension
0.92
participant
0.91
ingredient
0.90
facet
0.88
Activations Density 0.055%