INDEX
Explanations
phrases enclosed in square brackets
quoted speech or dialogue within the text
New Auto-Interp
Negative Logits
Gors
-0.67
Finals
-0.66
reckoning
-0.66
Salem
-0.64
Phoenix
-0.64
Luck
-0.63
Chennai
-0.63
Britann
-0.62
cancell
-0.61
SAM
-0.61
POSITIVE LOGITS
â̦]
1.61
...]
1.51
english
1.12
REDACTED
0.96
']
0.93
ederal
0.79
!]
0.78
entimes
0.78
:]
0.78
ideshow
0.78
Activations Density 0.032%