INDEX
Explanations
proper nouns and names associated with places or significant entities
New Auto-Interp
Negative Logits
pat
-0.63
et
-0.51
Pat
-0.50
quete
-0.50
s
-0.49
lecht
-0.49
v
-0.49
江湖
-0.48
Require
-0.48
dot
-0.48
POSITIVE LOGITS
pleaſure
0.91
Majefty
0.87
houſe
0.85
ſever
0.81
Efq
0.81
myſelf
0.80
themſelves
0.80
itſelf
0.79
leſs
0.78
ſtate
0.77
Activations Density 0.155%