INDEX
Explanations
references to societal structures and implications of privilege
Code, non-English words, or structural elements
size, length, width or shear
New Auto-Interp
Negative Logits
这是一个
-0.60
là
-0.55
são
-0.54
является
-0.53
是个
-0.49
是一個
-0.49
เป็น
-0.49
είναι
-0.48
merupakan
-0.48
serem
-0.47
POSITIVE LOGITS
nevertheless
0.89
twimg
0.76
nonetheless
0.76
HomeScreen
0.75
ConstraintMaker
0.74
deserve
0.74
SequentialGroup
0.71
useHistory
0.70
deserves
0.70
CURIAM
0.68
Activations Density 0.308%