INDEX
Explanations
references to tea and coffee drinks
New Auto-Interp
Negative Logits
'\\;'
-0.89
uxxxx
-0.79
]")]
-0.69
UNCIL
-0.68
GenerationType
-0.67
ⓧ
-0.65
orteur
-0.64
AssemblyTitle
-0.64
الحره
-0.63
ModelExpression
-0.62
POSITIVE LOGITS
drinks
1.17
coffee
1.14
tea
1.12
drink
1.11
drink
0.99
coffee
0.98
beverages
0.95
cappuccino
0.94
Drinks
0.94
Drinks
0.94
Activations Density 0.242%