INDEX
Explanations
phrases related to medical conditions or natural disasters
terms related to diagnosis and medical conditions
New Auto-Interp
Negative Logits
ting
-0.91
ted
-0.82
ters
-0.80
TING
-0.78
BOOK
-0.76
DOM
-0.73
FORE
-0.72
BOX
-0.72
ween
-0.71
ter
-0.71
POSITIVE LOGITS
ostic
1.41
ostics
1.17
ificent
1.09
ificant
1.04
itude
0.97
animous
0.93
osis
0.87
ados
0.87
olini
0.86
ancy
0.83
Activations Density 0.015%