INDEX
Explanations
web addresses and email addresses
sentences ending with a period
New Auto-Interp
Negative Logits
basis
-0.64
architectural
-0.64
attraction
-0.62
ħĭ
-0.61
sadly
-0.60
backward
-0.60
backwards
-0.59
trophy
-0.59
onga
-0.58
cod
-0.58
POSITIVE LOGITS
edu
1.05
com
0.93
tv
0.90
tumblr
0.83
0.82
blogspot
0.82
php
0.81
co
0.80
push
0.80
net
0.79
Activations Density 0.111%