INDEX
Explanations
screens or screenshots
instances of the word "screen" and its variations
New Auto-Interp
Negative Logits
aternal
-0.73
doms
-0.67
istani
-0.65
ortium
-0.64
Scientific
-0.64
akeru
-0.64
shire
-0.63
Dangerous
-0.60
plur
-0.59
vernment
-0.59
POSITIVE LOGITS
plays
1.27
grab
1.24
writer
1.16
writers
1.15
avers
1.11
caps
1.11
printed
1.11
writing
1.05
aver
1.03
casts
0.94
Activations Density 0.036%