INDEX
Explanations
phrases related to specific states or situations
references to specific conditions or criteria
New Auto-Interp
Negative Logits
sonian
-0.73
endar
-0.69
enza
-0.68
ring
-0.67
enic
-0.65
LS
-0.63
ritz
-0.63
adesh
-0.62
cart
-0.61
ashington
-0.61
POSITIVE LOGITS
conditions
0.95
ality
0.95
quo
0.93
deterior
0.93
condition
0.88
worsened
0.87
Conditions
0.87
aldehyde
0.84
permitting
0.84
deteriorated
0.80
Activations Density 0.045%