INDEX
Explanations
general words used with opinions
archaic or stylized English words
New Auto-Interp
Negative Logits
-0.56
__':
-0.51
__':
-0.50
WEBPACK
-0.50
])]
-0.48
lohnt
-0.47
karet
-0.46
llevaron
-0.46
pega
-0.46
-0.45
POSITIVE LOGITS
Efq
0.76
Theſe
0.74
houſe
0.74
Houſe
0.73
ſeveral
0.71
ſta
0.71
Jefus
0.71
ſtate
0.70
wiſe
0.69
myſelf
0.69
Activations Density 2.747%