INDEX
Explanations
words related to writing and bargaining
references to writing or written content
New Auto-Interp
Negative Logits
vain
-0.64
Idaho
-0.64
Scotia
-0.63
Peninsula
-0.63
motto
-0.63
Bravo
-0.63
odor
-0.62
ware
-0.62
Hale
-0.61
eldest
-0.61
POSITIVE LOGITS
es
1.21
terness
0.97
esh
0.94
hest
0.93
awa
0.93
ecake
0.93
hes
0.92
hou
0.88
egu
0.88
emen
0.87
Activations Density 0.033%