INDEX
Explanations
references to the concept of evolution
New Auto-Interp
Negative Logits
+#+#
-0.81
masculinos
-0.71
nesty
-0.70
fraî
-0.69
Hopf
-0.68
engraçadas
-0.66
Jurí
-0.65
poitrine
-0.65
IntoConstraints
-0.65
Gegend
-0.65
POSITIVE LOGITS
evolution
2.03
evolve
1.86
evolution
1.83
Evolution
1.77
EVOLUTION
1.70
Evolution
1.65
evolved
1.62
evolves
1.60
evolving
1.59
evolu
1.57
Activations Density 0.168%