INDEX
Explanations
words related to political and economic discussions
New Auto-Interp
Negative Logits
Gutenberg
-0.64
essence
-0.61
dere
-0.59
Nadu
-0.58
pora
-0.57
Hoo
-0.56
Ceres
-0.56
coni
-0.55
por
-0.55
bilt
-0.55
POSITIVE LOGITS
odies
1.38
amboo
1.19
ibli
1.17
rief
1.16
rows
1.15
ruary
1.11
asket
1.10
isexual
1.07
ishops
1.07
acteria
1.05
Activations Density 2.814%