INDEX
Explanations
instances of text referring to public or online postings
instances of the word "posted."
New Auto-Interp
Negative Logits
kos
-0.71
glers
-0.69
Brill
-0.67
Gand
-0.65
Iv
-0.64
osi
-0.64
Flavoring
-0.64
Nebula
-0.63
Galile
-0.61
cest
-0.61
POSITIVE LOGITS
hum
0.96
ulate
0.93
postings
0.86
prominently
0.80
online
0.78
mortem
0.78
pics
0.78
onymous
0.77
posted
0.77
gres
0.76
Activations Density 0.037%