INDEX
Explanations
verbs that indicate actions or processes
New Auto-Interp
Negative Logits
存于互联网档案馆
-0.63
houſe
-0.59
fubject
-0.59
pleaſure
-0.57
ſever
-0.57
perſon
-0.54
ſtate
-0.54
Houſe
-0.52
ſub
-0.52
Majefty
-0.51
POSITIVE LOGITS
s
0.79
es
0.71
للمعارف
0.68
roaches
0.68
kes
0.68
zes
0.67
otes
0.66
ixes
0.66
odes
0.66
ks
0.66
Activations Density 0.672%