INDEX
Explanations
instances of words related to requests or desires
references to requests or appeals for something
New Auto-Interp
Negative Logits
dule
-0.67
Chop
-0.64
Kelley
-0.64
Insurance
-0.62
sshd
-0.62
Accounting
-0.60
Frankfurt
-0.60
Rocket
-0.59
Junk
-0.59
Prospect
-0.58
POSITIVE LOGITS
pleas
1.34
urable
1.32
ured
0.95
ĸļ
0.95
icular
0.94
geoning
0.94
izu
0.88
gon
0.88
uit
0.86
alty
0.84
Activations Density 0.009%