INDEX
Explanations
phrases related to medical emergencies or conditions posing a significant risk to life
references to life-threatening conditions or injuries
New Auto-Interp
Negative Logits
ulhu
-0.92
Ces
-0.75
Cree
-0.72
BLIC
-0.68
Carnegie
-0.65
Ell
-0.64
Cotton
-0.63
ño
-0.62
aco
-0.62
edo
-0.60
POSITIVE LOGITS
saving
1.30
cycle
1.12
sized
1.07
cycles
1.06
consuming
1.01
tested
0.99
changing
0.98
eating
0.98
loving
0.96
intensive
0.94
Activations Density 0.042%