INDEX
Explanations
phrases indicating completion or conclusion
instances of the word "over" indicating completion or end of situations
New Auto-Interp
Negative Logits
partName
-0.73
AMA
-0.71
Forward
-0.65
Express
-0.61
forwards
-0.61
yssey
-0.60
interacts
-0.60
osity
-0.60
ãĥı
-0.59
Likes
-0.59
POSITIVE LOGITS
rated
1.11
priced
1.03
blown
1.03
stated
1.00
kill
1.00
reaching
0.98
haul
0.96
whelming
0.95
loading
0.95
whel
0.94
Activations Density 0.039%