INDEX
Explanations
the word "life"
references to life, living, or vitality-related themes
New Auto-Interp
Negative Logits
chall
-0.75
PASS
-0.65
Sabha
-0.62
secret
-0.60
Franken
-0.59
Liberals
-0.59
Program
-0.59
Minnesota
-0.59
Marginal
-0.58
fine
-0.58
POSITIVE LOGITS
ife
1.16
yip
0.97
mite
0.88
yer
0.78
terness
0.77
llor
0.75
zie
0.75
ternity
0.75
lette
0.72
etus
0.72
Activations Density 0.013%