INDEX
Explanations
terms indicating specificity and distinctiveness in contexts, particularly related to personal or organizational responsibility
New Auto-Interp
Head Attr Weights
0:0.01
1:0.08
2:0.21
3:0.03
4:0.01
5:0.03
6:0.12
7:0.09
8:0.10
9:0.13
10:0.08
11:0.06
Negative Logits
existed
-0.89
MT
-0.85
eh
-0.85
GP
-0.83
Cause
-0.82
Verse
-0.82
exists
-0.82
blacks
-0.82
Comments
-0.80
ethic
-0.79
POSITIVE LOGITS
Reloaded
0.99
ongo
0.99
ディ
0.95
latest
0.94
versely
0.93
コ
0.92
inky
0.91
accordingly
0.91
Rated
0.91
��
0.91
Activations Density 0.212%