INDEX
Explanations
repetitive uses or mentions of the word "it."
New Auto-Interp
Negative Logits
araw
-0.81
Badger
-0.80
Monfieur
-0.76
Dede
-0.74
Badger
-0.72
Conſ
-0.71
andes
-0.71
PRS
-0.68
münchen
-0.67
noel
-0.66
POSITIVE LOGITS
it
1.54
It
1.39
+#+#
1.30
It
1.29
its
1.27
它
1.24
Its
1.19
它
1.18
IT
1.16
Its
1.16
Activations Density 0.345%