INDEX
Explanations
phrases related to the subject of discussion or focus
the phrase "subject of" in various contexts, often relating to controversy or discussion
New Auto-Interp
Negative Logits
ents
-0.72
hing
-0.69
jp
-0.65
pine
-0.64
hes
-0.63
ITH
-0.63
nce
-0.62
behaved
-0.62
whe
-0.61
chens
-0.61
POSITIVE LOGITS
ridicule
1.11
intense
0.95
controversy
0.92
scorn
0.90
fierce
0.89
contention
0.89
attention
0.87
ire
0.87
ENTION
0.86
criticism
0.84
Activations Density 0.087%