INDEX
Explanations
specific references to something being the subject of discussion or analysis
references to a specific topic or concept that is being emphasized
New Auto-Interp
Negative Logits
amp
-0.72
esm
-0.69
owed
-0.68
adle
-0.68
ortment
-0.66
rollers
-0.66
wives
-0.66
ahs
-0.65
Band
-0.65
Scores
-0.64
POSITIVE LOGITS
particular
1.11
trope
1.01
phenomenon
1.01
predicament
0.97
newfound
0.97
discrepancy
0.95
arrangement
0.91
arrang
0.90
outcome
0.86
omission
0.85
Activations Density 0.215%