INDEX
Explanations
imperative verbs or commands
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.07
3:0.08
4:0.08
5:0.08
6:0.07
7:0.08
8:0.07
9:0.08
10:0.07
11:0.07
Negative Logits
Circus
-2.34
darts
-2.20
dared
-2.15
performing
-2.12
imitation
-2.09
Martian
-2.08
experiment
-2.05
constitu
-2.04
primates
-2.03
pupils
-2.00
POSITIVE LOGITS
luaj
2.92
�
2.55
inventoryQuantity
2.39
リ
2.35
displayText
2.29
details
2.28
dow
2.27
isites
2.21
ィ
2.17
�
2.17
Activations Density 0.000%