INDEX
Explanations
phrases or sentences indicating an end or completion
repeated mentions of the word "over."
New Auto-Interp
Negative Logits
osity
-0.75
Forward
-0.68
resy
-0.63
ãĥı
-0.63
associates
-0.62
partName
-0.61
yssey
-0.60
Reference
-0.59
Galactic
-0.58
Readers
-0.58
POSITIVE LOGITS
kill
1.20
blown
1.16
rated
1.08
priced
1.06
stated
1.05
reaching
1.02
whelming
0.98
valued
0.97
drive
0.96
loading
0.94
Activations Density 0.030%