INDEX
Explanations
interactions involving persuasion and familial relationships
New Auto-Interp
Negative Logits
erule
-0.16
fsp
-0.15
inox
-0.15
arel
-0.15
ritel
-0.14
Pant
-0.14
wcs
-0.14
ustos
-0.14
-valu
-0.13
è³Ģ
-0.13
POSITIVE LOGITS
convince
0.43
pers
0.42
conv
0.41
persuade
0.40
convincing
0.38
convin
0.38
Conv
0.38
persu
0.36
Pers
0.36
thuyết
0.35
Activations Density 0.473%