INDEX
Explanations
English, D, Adventure, Razor, Tot, Cong, Doll, Law
New Auto-Interp
Negative Logits
zetek
0.48
the
0.46
ates
0.45
ritic
0.44
to
0.43
éz
0.43
ulas
0.43
ewski
0.43
Watanabe
0.42
uj
0.41
POSITIVE LOGITS
finest
0.66
folly
0.57
niece
0.56
Finest
0.55
Choice
0.54
revenge
0.51
erster
0.49
birthday
0.49
Revenge
0.49
biographer
0.49
Activations Density 0.013%