INDEX
Explanations
proper names related to individuals
specific terms and names related to individuals and entities
New Auto-Interp
Negative Logits
ufact
-0.82
yip
-0.80
icipated
-0.80
icip
-0.75
emouth
-0.71
owship
-0.67
allery
-0.67
gobl
-0.67
achev
-0.65
isites
-0.65
POSITIVE LOGITS
Redditor
0.77
lyn
0.74
Jav
0.73
Allah
0.71
opian
0.65
Zar
0.64
266
0.64
Pacific
0.63
opolis
0.62
iba
0.61
Activations Density 0.297%