INDEX
Explanations
phrases related to research findings or data analysis
references to research findings or outcomes
New Auto-Interp
Negative Logits
gger
-0.72
queue
-0.67
[_
-0.63
timer
-0.63
ifle
-0.63
besides
-0.61
erson
-0.60
icker
-0.59
forbid
-0.59
let
-0.58
POSITIVE LOGITS
findings
3.46
conclusions
1.99
discoveries
1.91
results
1.78
observations
1.66
recommendations
1.60
results
1.52
insights
1.44
hypotheses
1.42
analyses
1.42
Activations Density 0.010%