INDEX
Negative Logits
thrott
-0.62
estate
-0.61
vered
-0.61
chops
-0.61
arers
-0.60
roph
-0.59
split
-0.59
orate
-0.59
ties
-0.58
Ń·
-0.58
POSITIVE LOGITS
pmwiki
0.96
ibliography
0.95
Sources
0.90
BOOK
0.87
âĨij
0.86
agascar
0.85
sites
0.81
Encyclopedia
0.78
Books
0.77
Sources
0.76
Activations Density 16.297%