INDEX
Explanations
phrases expressing strong emotions or emphasis
repetitions of the word "so."
New Auto-Interp
Negative Logits
{:-0.65
lihood
-0.63
expectancy
-0.62
actionDate
-0.61
path
-0.60
backdrop
-0.59
arrival
-0.59
hurst
-0.59
passage
-0.59
Peninsula
-0.57
POSITIVE LOGITS
oooo
1.11
bered
1.11
ooo
1.10
oner
1.03
oths
1.01
othe
1.01
oooooooooooooooo
0.97
arin
0.97
zin
0.95
oooooooo
0.92
Activations Density 0.113%