INDEX
Explanations
phrases that convey contrasts or conflicts
New Auto-Interp
Negative Logits
valoración
-0.41
vectorielle
-0.41
juges
-0.41
userdetails
-0.39
↵
-0.39
licorne
-0.38
solidaridad
-0.38
testigo
-0.37
unicornio
-0.36
Asegúrese
-0.36
POSITIVE LOGITS
1.53
1.51
1.48
1.46
1.45
1.44
1.42
1.42
<0x17>
1.36
1.35
Activations Density 0.553%