INDEX
Explanations
information or details following the abbreviation "i.e."
punctuation marks, particularly periods
New Auto-Interp
Negative Logits
itiz
-0.74
istar
-0.65
polyg
-0.64
orn
-0.61
opportunity
-0.61
rall
-0.61
ima
-0.60
iens
-0.60
outreach
-0.59
burglary
-0.57
POSITIVE LOGITS
wikipedia
0.83
-)
0.76
asp
0.76
psc
0.76
sbm
0.74
+)
0.74
jpg
0.74
large
0.74
suff
0.73
Downloadha
0.72
Activations Density 0.047%