INDEX
Explanations
references to the concept of "more" or increases in quantity
New Auto-Interp
Negative Logits
SNL
-0.73
(&:
-0.73
bufio
-0.71
Caballero
-0.70
️
-0.70
❥
-0.69
Cluj
-0.69
__()
-0.68
africana
-0.66
Discografia
-0.66
POSITIVE LOGITS
more
1.77
MORE
1.60
more
1.53
More
1.41
More
1.41
MORE
1.41
Moreno
1.22
Moreira
1.13
Moreau
1.12
emore
1.12
Activations Density 0.147%