INDEX
Explanations
mentions of literature or literary-related terms
references to literary concepts and discussions
New Auto-Interp
Negative Logits
lain
-1.02
imal
-0.86
Downloadha
-0.85
lessly
-0.83
cellent
-0.80
alos
-0.78
ned
-0.78
etts
-0.75
ional
-0.75
nered
-0.74
POSITIVE LOGITS
significance
0.91
curiosity
0.85
heritage
0.84
pecul
0.82
prowess
0.82
performances
0.81
achievement
0.81
inspiration
0.80
excellence
0.79
liberties
0.79
Activations Density 0.052%