INDEX
Explanations
references to the concept of "life."
New Auto-Interp
Negative Logits
idéia
-0.50
ientras
-0.49
utuhkan
-0.49
Modelos
-0.43
للمعارف
-0.42
envolvimento
-0.42
asegurado
-0.42
khuy
-0.41
HostException
-0.41
Aún
-0.41
POSITIVE LOGITS
life
0.65
Life
0.62
life
0.61
生活
0.61
vida
0.60
Life
0.59
LIFE
0.50
Leben
0.50
Living
0.48
living
0.47
Activations Density 0.025%