INDEX
Explanations
references to candy and food-related items
New Auto-Interp
Negative Logits
بوابة
-0.58
:✨
-0.57
thâu
-0.57
tableFuture
-0.53
domés
-0.53
ویکیپدی
-0.53
PeEnEo
-0.50
betweenstory
-0.49
jspx
-0.49
SourceChecksum
-0.48
POSITIVE LOGITS
candy
0.98
candies
0.94
candy
0.77
gummy
0.77
chewy
0.76
gummies
0.76
treats
0.73
Candy
0.69
Hari
0.69
snack
0.69
Activations Density 0.216%