INDEX
Explanations
conjunctions indicating conditional statements or hypothetical scenarios
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.07
3:0.09
4:0.09
5:0.08
6:0.10
7:0.07
8:0.08
9:0.09
10:0.08
11:0.07
Negative Logits
Lucy
-3.05
Chloe
-3.03
Que
-3.02
elight
-2.84
Cloak
-2.73
Roz
-2.70
Blink
-2.70
Bailey
-2.68
Pick
-2.68
Show
-2.62
POSITIVE LOGITS
AMD
3.27
Motorola
3.15
chnology
2.96
Battery
2.94
overcl
2.83
conom
2.79
orsche
2.78
mining
2.72
edom
2.69
gew
2.69
Activations Density 0.000%