INDEX
Explanations
words or phrases related to events or actions coming to a conclusion
instances of the word "end" and its variations, indicating conclusions or terminations
New Auto-Interp
Negative Logits
hee
-0.68
broom
-0.61
esome
-0.61
onne
-0.57
Parables
-0.56
rooft
-0.55
reused
-0.55
minecraft
-0.55
icipated
-0.54
squared
-0.54
POSITIVE LOGITS
prematurely
1.12
angering
1.09
abruptly
1.07
tragically
0.96
owment
0.87
peacefully
0.86
angers
0.84
ear
0.81
orses
0.79
eared
0.78
Activations Density 0.046%