INDEX
Explanations
references to emotional or psychological support
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.07
4:0.08
5:0.07
6:0.09
7:0.06
8:0.08
9:0.08
10:0.08
11:0.09
Negative Logits
Alexa
-2.59
reliability
-2.49
Rel
-2.47
oper
-2.45
reliable
-2.42
Mitchell
-2.35
sonic
-2.34
sqor
-2.31
blasting
-2.31
Phantom
-2.31
POSITIVE LOGITS
anism
2.83
osaurus
2.80
Rousse
2.77
anarchism
2.76
agine
2.72
AME
2.70
AIDS
2.67
celona
2.66
Socrates
2.65
arian
2.64
Activations Density 0.000%