INDEX
Explanations
descriptions of legal proceedings
instances of strong emotional responses or significant events
New Auto-Interp
Negative Logits
Additionally
-1.02
Conclusion
-0.96
Furthermore
-0.92
Firstly
-0.90
Recommend
-0.90
Secondly
-0.89
Therefore
-0.89
Whilst
-0.87
Regarding
-0.87
[/
-0.86
POSITIVE LOGITS
mornings
1.07
strangers
1.03
dusk
0.97
adolescence
0.96
smiles
0.93
startled
0.88
greets
0.87
twent
0.87
passers
0.84
perched
0.83
Activations Density 0.829%