INDEX
Explanations
the phrase "what exactly" indicating a request for clarification or detail
questions asking for a specific explanation or detail
New Auto-Interp
Negative Logits
ĸļ
-0.76
NYSE
-0.67
puter
-0.67
gra
-0.63
newsletters
-0.62
WAYS
-0.61
sha
-0.60
gi
-0.59
«ĺ
-0.59
1965
-0.58
POSITIVE LOGITS
transpired
1.05
separates
0.97
distinguishes
0.94
soever
0.88
happened
0.84
differs
0.84
entails
0.84
happens
0.81
constitutes
0.81
awaits
0.79
Activations Density 0.036%