INDEX
Explanations
expressions of gratitude or thanks
New Auto-Interp
Negative Logits
ince
-0.15
elsey
-0.14
ż
-0.14
aley
-0.14
apr
-0.14
alth
-0.13
ardo
-0.13
SSIP
-0.13
cert
-0.13
oval
-0.13
POSITIVE LOGITS
so
0.30
again
0.25
very
0.24
bunch
0.24
much
0.21
everyone
0.21
heaps
0.20
beaucoup
0.18
again
0.18
tons
0.18
Activations Density 0.012%