INDEX
Negative Logits
Celebrity
-0.10
Celebrity
-0.09
astrology
-0.09
기간
-0.09
Investor
-0.09
Astrology
-0.08
Lamborghini
-0.08
lyrics
-0.08
reconciliation
-0.08
divorce
-0.08
POSITIVE LOGITS
neighbors
0.16
neighboring
0.16
adjacent
0.15
neighbor
0.15
adjacency
0.14
Adjacent
0.14
Neighbors
0.14
neighbors
0.13
neigh
0.13
lattice
0.13
Activations Density 0.030%