INDEX
Explanations
abbreviations related to organizations
abbreviations or acronyms related to organizations and associations
New Auto-Interp
Negative Logits
Redditor
-0.72
captcha
-0.68
unden
-0.67
tyr
-0.66
bour
-0.64
streng
-0.63
desper
-0.59
weap
-0.58
paralysis
-0.58
depress
-0.58
POSITIVE LOGITS
)'
1.43
)
1.37
),
1.33
)—
1.19
)-
1.15
)/
1.06
)[
1.05
),"
1.05
).
1.03
)!
1.01
Activations Density 0.069%