INDEX
Explanations
references to specific entities or concepts of interest
instances of the word "the" and assess their prevalence
New Auto-Interp
Negative Logits
Discuss
-0.77
bj
-0.75
agues
-0.74
sburg
-0.72
buster
-0.71
Default
-0.71
Goal
-0.69
thood
-0.68
ndum
-0.68
wark
-0.67
POSITIVE LOGITS
fact
1.86
sheer
1.39
amount
1.27
way
1.24
plethora
1.19
absence
1.16
multitude
1.15
abundance
1.15
extent
1.11
manner
1.10
Activations Density 0.304%