INDEX
Explanations
elements related to identity and relationships
New Auto-Interp
Negative Logits
.
-0.94
,
-0.73
;
-0.68
:
-0.56
。
-0.53
—
-0.50
(
-0.50
↵↵
-0.49
–
-0.49
…
-0.47
POSITIVE LOGITS
itſelf
1.10
RectangleBorder
1.02
MainAxisSize
1.01
Theſe
0.99
―――――
0.96
་་
0.96
setVerticalGroup
0.95
ProtoMessage
0.91
IsContent
0.91
httphttps
0.91
Activations Density 1.103%