INDEX
Explanations
the phrase "between" followed by numerical ranges
New Auto-Interp
Negative Logits
OGR
-0.87
gow
-0.75
rav
-0.74
eto
-0.73
vous
-0.70
jew
-0.66
unfocusedRange
-0.63
aptic
-0.63
atically
-0.62
quished
-0.61
POSITIVE LOGITS
halves
0.81
sexes
0.76
genders
0.74
bouts
0.66
incomes
0.65
1945
0.63
midnight
0.63
1910
0.62
stocks
0.62
extremes
0.62
Activations Density 0.027%