INDEX
Explanations
phrases emphasizing the importance of care and attentiveness towards oneself and others
giving care or protection
New Auto-Interp
Negative Logits
attempt
-0.43
migrationBuilder
-0.42
sign
-0.39
experiment
-0.37
ⓧ
-0.37
attempts
-0.36
Felton
-0.36
truy
-0.35
FromFile
-0.35
festival
-0.34
POSITIVE LOGITS
Houſe
0.73
protect
0.68
Care
0.66
CARE
0.66
Caring
0.66
cuida
0.65
houſe
0.65
care
0.65
maintenance
0.63
cuidar
0.63
Activations Density 0.008%