INDEX
Explanations
understated charisma and magnetism
New Auto-Interp
Negative Logits
vomiting
0.85
新增
0.78
fantasies
0.72
hyped
0.70
狰
0.69
suicidal
0.68
glorify
0.67
cosmetic
0.67
痛苦
0.67
প্রহ
0.66
POSITIVE LOGITS
quietly
1.02
quiet
1.02
infectious
0.97
unassuming
0.97
magnetism
0.94
understated
0.92
generously
0.91
kindness
0.88
charisma
0.87
uncomplicated
0.86
Activations Density 0.379%