INDEX
Explanations
mentions of someone being alive
occurrences of the word "alive."
New Auto-Interp
Negative Logits
pled
-0.71
ple
-0.71
RECT
-0.69
ples
-0.63
agree
-0.61
ij
-0.61
Aerospace
-0.60
arov
-0.60
CHAT
-0.60
ãĥĩ
-0.60
POSITIVE LOGITS
abouts
0.97
lihood
0.92
alive
0.87
nces
0.81
lier
0.78
Alive
0.76
mares
0.74
hart
0.72
beat
0.69
guards
0.69
Activations Density 0.016%