INDEX
Explanations
references to emotional states and struggles in contexts related to trauma, attachment, and mental health
New Auto-Interp
Negative Logits
papild
-0.53
nė
-0.52
linho
-0.51
esteja
-0.50
darb
-0.50
izacin
-0.50
сло
-0.49
panty
-0.49
auquel
-0.48
Kesehatan
-0.48
POSITIVE LOGITS
etc
1.13
etc
0.91
kasarigan
0.84
समीक्षक
0.82
!("{0.75
Etc
0.75
usw
0.74
그리고
0.72
kháu
0.69
आदि
0.67
Activations Density 0.355%