INDEX
Explanations
specific instructions or guidelines related to safety and care in emergency situations
New Auto-Interp
Negative Logits
inward
-0.19
downhill
-0.18
krom
-0.15
onward
-0.15
cip
-0.14
ssi
-0.14
outward
-0.14
istik
-0.13
outgoing
-0.13
Decre
-0.13
POSITIVE LOGITS
above
0.75
above
0.70
Above
0.69
ABOVE
0.67
Above
0.66
ä¸Ĭ
0.61
bove
0.60
upper
0.60
higher
0.58
_above
0.56
Activations Density 0.415%