INDEX
Explanations
statements or facts that end with punctuation marks like periods or commas
sentences that indicate a significant event or statement
New Auto-Interp
Negative Logits
brill
-0.78
honoured
-0.75
welcome
-0.74
favourite
-0.72
organise
-0.71
endeavour
-0.71
elbow
-0.70
isable
-0.70
handshake
-0.69
rubbish
-0.68
POSITIVE LOGITS
âĢ
1.99
»
1.41
âĹı
1.40
âĢ
1.39
ãĢ
1.38
âĢł
1.35
1.35
**
1.33
âϦ
1.26
âĸł
1.26
Activations Density 0.612%