INDEX
Explanations
keywords followed by a positive or negative adjective, focusing on evaluation
instances where the term "Such" and its contextual relevance are highlighted
New Auto-Interp
Negative Logits
folks
-0.75
ladies
-0.63
guys
-0.62
kids
-0.61
basics
-0.58
reset
-0.58
ladder
-0.57
everyone
-0.57
countdown
-0.56
version
-0.56
POSITIVE LOGITS
Such
3.09
Such
2.85
such
1.87
such
1.67
These
1.28
Particularly
1.27
Thus
1.20
Those
1.18
Similarly
1.17
Moreover
1.16
Activations Density 0.013%