INDEX
Explanations
attends to abstract or conceptual tokens from more specific, concrete tokens related to actions or states
New Auto-Interp
Head Attr Weights
0:0.07
1:0.09
2:0.13
3:0.08
4:0.07
5:0.02
6:0.19
7:0.31
Negative Logits
ویکیپدیا
-0.24
ferd
-0.23
año
-0.22
Rockefeller
-0.22
يتيمه
-0.22
same
-0.22
ViewStyle
-0.21
location
-0.21
PageIndex
-0.21
AttributeSet
-0.21
POSITIVE LOGITS
NUMX
0.41
はじめに
0.36
помним
0.35
%?
0.35
?')
0.34
?>
0.34
Bref
0.34
leſs
0.34
»?
0.33
fieldNum
0.33
Activations Density 0.351%