INDEX
Explanations
phrases and terms related to caregiving
New Auto-Interp
Negative Logits
ondere
-0.18
ropol
-0.17
mith
-0.17
artin
-0.17
sher
-0.17
ski
-0.17
burgh
-0.16
rup
-0.16
swick
-0.16
sis
-0.16
POSITIVE LOGITS
fully
0.32
free
0.30
ening
0.25
full
0.25
lessly
0.25
ful
0.24
lessness
0.24
ering
0.22
ismatic
0.22
FULL
0.21
Activations Density 0.013%