INDEX
Explanations
references to the concept of life and individual's experiences
New Auto-Interp
Negative Logits
Schot
-0.69
buttons
-0.61
<bos>
-0.59
Maldonado
-0.57
zuki
-0.57
Schult
-0.57
temptations
-0.56
obstacles
-0.56
Schmidt
-0.54
Kapp
-0.54
POSITIVE LOGITS
lives
1.77
Lives
1.72
lives
1.63
Lives
1.62
LIVES
1.55
vidas
1.18
lived
0.81
lived
0.81
życia
0.78
Live
0.77
Activations Density 0.003%