INDEX
Explanations
phrases indicating increasing quantity or intensity
conjunctions and phrases indicating a relationship of addition or comparison
New Auto-Interp
Negative Logits
ij士
-0.73
RAW
-0.70
Peaks
-0.65
ãĤ´ãĥ³
-0.65
robber
-0.64
Sparrow
-0.64
Saud
-0.64
Blaz
-0.63
Bulgar
-0.63
Cros
-0.62
POSITIVE LOGITS
vous
0.87
ificantly
0.86
angled
0.78
than
0.77
than
0.76
efficient
0.76
acho
0.76
fficient
0.75
cientious
0.74
itably
0.74
Activations Density 0.141%