INDEX
Explanations
phrases indicating a lack or absence of something
New Auto-Interp
Negative Logits
ricultural
-0.51
DataAnnotations
-0.50
eduardo
-0.50
megane
-0.49
Edu
-0.48
Oste
-0.47
Alfa
-0.47
Alfa
-0.47
Stewart
-0.46
ǜ
-0.46
POSITIVE LOGITS
None
1.34
none
1.34
none
1.31
None
1.31
NONE
1.08
NONE
1.03
ninguno
0.93
neither
0.69
nessuno
0.66
neither
0.65
Activations Density 0.114%