INDEX
Explanations
phrases indicating an upcoming event
phrases indicating imminent actions or events
New Auto-Interp
Negative Logits
Compass
-0.74
Howe
-0.65
perspective
-0.65
Cooke
-0.64
Corpus
-0.63
raped
-0.62
gateway
-0.61
representations
-0.60
simulator
-0.60
Perspective
-0.59
POSITIVE LOGITS
pless
0.97
pload
0.91
NetMessage
0.89
arrive
0.83
be
0.80
announce
0.79
legalize
0.78
unveil
0.76
give
0.76
come
0.74
Activations Density 0.045%