INDEX
Explanations
dishonest, irresponsible, wrong
New Auto-Interp
Negative Logits
Needs
0.85
Needs
0.80
necesita
0.78
incubated
0.74
trouxe
0.72
necessidades
0.72
Bedürfnisse
0.71
brauchen
0.71
nascost
0.69
byly
0.69
POSITIVE LOGITS
tantamount
0.99
equivalent
0.97
equivalent
0.91
opposite
0.89
committing
0.85
opposite
0.82
denying
0.78
contradictory
0.76
противополо
0.75
مخالف
0.75
Activations Density 0.150%