INDEX
Explanations
expressions of gratitude and Thanksgiving experiences
New Auto-Interp
Negative Logits
isko
-0.15
ozem
-0.15
January
-0.15
Saturdays
-0.15
February
-0.14
Summer
-0.14
JAN
-0.14
lero
-0.14
åĴ
-0.14
nte
-0.14
POSITIVE LOGITS
turkey
0.55
Thanksgiving
0.52
Turkey
0.49
Thank
0.46
thank
0.46
Turkey
0.43
thankful
0.43
tur
0.42
tur
0.41
Thank
0.41
Activations Density 0.080%