INDEX
Explanations
numeric patterns in the format "number-number" indicating a range of values
phrases related to numerical scales or ratings
New Auto-Interp
Negative Logits
Mechdragon
-0.69
cheated
-0.65
solicitation
-0.64
SIG
-0.63
lax
-0.59
Canary
-0.58
ãģ®å®
-0.58
BCC
-0.58
Rolls
-0.57
Toro
-0.57
POSITIVE LOGITS
hour
0.88
four
0.83
month
0.83
three
0.83
seven
0.83
hours
0.78
equal
0.78
quart
0.78
abit
0.77
poons
0.77
Activations Density 0.068%