INDEX
Explanations
instances of the word "the" along with language that compares or contrasts different subjects
New Auto-Interp
Negative Logits
1
-0.16
01
-0.14
Talk
-0.14
dynasty
-0.13
UIWindow
-0.13
461
-0.13
orem
-0.13
lias
-0.13
989
-0.13
100
-0.13
POSITIVE LOGITS
ones
0.28
similarly
0.24
others
0.22
earlier
0.19
originals
0.18
others
0.17
previous
0.17
'autres
0.16
decess
0.16
éĤ£äºĽ
0.16
Activations Density 0.135%