INDEX
Explanations
references to therapy-related topics, particularly those concerning emotional well-being and mental health
New Auto-Interp
Negative Logits
opak
-0.15
icles
-0.14
Lamb
-0.14
HWND
-0.14
inea
-0.14
698
-0.14
_BATCH
-0.14
baptized
-0.13
UPS
-0.13
UPI
-0.13
POSITIVE LOGITS
Mast
0.17
AXIS
0.16
odor
0.16
-NLS
0.14
Sport
0.14
Ń
0.14
lyph
0.14
Maher
0.14
ession
0.14
anger
0.14
Activations Density 0.135%