INDEX
Explanations
occurrences of the word "the" and its significance in text
New Auto-Interp
Negative Logits
wen
-0.85
theirs
-0.75
thood
-0.72
FILE
-0.71
cles
-0.70
bg
-0.67
iva
-0.67
owl
-0.67
NULL
-0.67
tumblr
-0.66
POSITIVE LOGITS
workings
1.09
latest
1.08
origins
1.06
complexities
1.06
beginnings
1.05
extent
1.03
basics
1.01
importance
1.01
finer
1.01
similarities
1.00
Activations Density 0.180%