INDEX
Explanations
words related to political topics and procedures
New Auto-Interp
Negative Logits
livion
-0.62
="/
-0.60
imposed
-0.60
tnc
-0.59
sic
-0.56
hers
-0.56
hyde
-0.56
alas
-0.56
latter
-0.55
@@
-0.54
POSITIVE LOGITS
cknowled
0.87
Definitions
0.80
Advice
0.73
Own
0.72
Disapp
0.69
Basics
0.69
Overview
0.69
Gets
0.68
Emails
0.68
Introduction
0.68
Activations Density 0.173%