INDEX
Explanations
terms and phrases related to informal language forms and euphemisms
New Auto-Interp
Negative Logits
dinand
-0.77
olphin
-0.75
gio
-0.71
hardt
-0.69
itor
-0.69
Tickets
-0.67
Democr
-0.66
negie
-0.66
galitarian
-0.65
Enlight
-0.64
POSITIVE LOGITS
shorthand
1.12
terminology
0.96
phrases
0.92
jargon
0.92
phrase
0.89
coined
0.88
isms
0.84
language
0.83
diction
0.82
term
0.82
Activations Density 0.015%