INDEX
Explanations
comparisons between different levels of ability or quality
phrases related to proficiency and comparisons of skill levels
New Auto-Interp
Negative Logits
ancies
-0.76
hea
-0.71
risome
-0.70
iann
-0.69
ategory
-0.69
fty
-0.69
NetMessage
-0.65
ums
-0.65
anism
-0.65
oya
-0.65
POSITIVE LOGITS
behaved
1.03
acquainted
0.94
negotiators
0.93
buddies
0.88
negotiator
0.87
suited
0.87
performers
0.83
friends
0.83
enough
0.81
defensively
0.80
Activations Density 0.113%