INDEX
Explanations
quotations enclosed in double quotation marks
punctuations and phrases indicating excitement or urgency
New Auto-Interp
Negative Logits
949
-0.74
riott
-0.73
ãĥª
-0.73
arl
-0.71
oint
-0.70
arian
-0.69
Sil
-0.68
urrent
-0.68
arrell
-0.68
ully
-0.67
POSITIVE LOGITS
Go
1.93
Go
1.80
go
1.79
GO
1.70
go
1.56
GO
1.47
Goo
1.33
gone
1.24
Goes
1.17
went
1.12
Activations Density 0.155%