INDEX
Explanations
content related to citizenship status and legal rights
New Auto-Interp
Negative Logits
TagMode
-0.57
<<<<<<<<<<<<<<
-0.56
aarrggbb
-0.56
fapt
-0.56
Insee
-0.56
ſtate
-0.55
Chham
-0.55
ValueStyle
-0.54
lgari
-0.54
ſeveral
-0.54
POSITIVE LOGITS
tampoco
0.77
neither
0.65
Neither
0.63
neither
0.63
Neither
0.61
也不能
0.59
nor
0.58
too
0.55
obvious
0.55
都不能
0.53
Activations Density 0.552%