INDEX
Explanations
website links and URLs
instances of ellipses or truncation in text
New Auto-Interp
Negative Logits
entimes
-0.83
suspic
-0.81
wielded
-0.79
Lomb
-0.72
nodd
-0.68
tampering
-0.67
Ͻ
-0.67
diminishing
-0.67
irens
-0.66
choke
-0.66
POSITIVE LOGITS
âĢİ
0.92
Done
0.85
=#
0.75
Delivery
0.73
prop
0.72
rock
0.72
itect
0.72
BRE
0.70
etc
0.70
abil
0.69
Activations Density 0.030%