INDEX
Explanations
concepts related to measurement and evaluation in various contexts
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.17
3:0.10
4:0.23
5:0.03
6:0.04
7:0.14
8:0.04
9:0.04
10:0.08
11:0.05
Negative Logits
Congratulations
-1.69
eah
-1.67
Congratulations
-1.64
rats
-1.59
Flavoring
-1.54
yssey
-1.52
fell
-1.51
prus
-1.50
Joined
-1.49
�
-1.48
POSITIVE LOGITS
cruc
1.44
manageable
1.39
alloy
1.39
stroke
1.36
heel
1.35
サーティワン
1.34
practicable
1.33
appropriate
1.33
prescribed
1.32
":["
1.32
Activations Density 0.029%