INDEX
Explanations
phrases indicating a large amount or quantity
repetitive phrases emphasizing quantity or frequency
New Auto-Interp
Negative Logits
ħĭ
-0.90
ĸļ
-0.82
acus
-0.81
atis
-0.78
usp
-0.74
ule
-0.74
ful
-0.73
ĪĴ
-0.71
gypt
-0.69
acle
-0.68
POSITIVE LOGITS
times
0.92
things
0.85
fun
0.84
interesting
0.81
mileage
0.78
unanswered
0.76
misinformation
0.75
money
0.75
variability
0.74
stuff
0.73
Activations Density 0.086%