INDEX
Explanations
authors of various written works
phrases indicating authorship and the number of works by different authors
New Auto-Interp
Negative Logits
Enlarge
-0.71
MSN
-0.65
Fuj
-0.62
magnification
-0.60
grouping
-0.60
broom
-0.60
camer
-0.58
pring
-0.58
loosen
-0.57
limitation
-0.57
POSITIVE LOGITS
itatively
0.79
books
0.77
letters
0.75
memoir
0.74
Awakens
0.73
blogs
0.71
Surv
0.69
novels
0.68
essays
0.67
poems
0.66
Activations Density 0.072%