INDEX
Explanations
periods at the end of sentences
New Auto-Interp
Negative Logits
charism
-0.69
clipboard
-0.68
glim
-0.66
cientious
-0.64
glasses
-0.62
Hitman
-0.60
Stoke
-0.59
Shinra
-0.59
mund
-0.58
slate
-0.57
POSITIVE LOGITS
ctuary
0.72
jong
0.72
lopp
0.71
%%
0.67
Authors
0.67
vae
0.67
Spons
0.67
tm
0.66
jas
0.66
aneously
0.65
Activations Density 0.244%