INDEX
Explanations
terms related to drinking or beverages
New Auto-Interp
Negative Logits
AutoModerator
-0.48
propOrder
-0.42
Ehrungen
-0.40
Viitteet
-0.40
ValueGeneration
-0.38
فن
-0.38
-0.37
сад
-0.36
Jacinto
-0.36
هن
-0.36
POSITIVE LOGITS
drinking
1.09
drinking
1.02
Drinking
1.00
Drinking
0.98
0.89
0.80
printf
0.80
0.79
DRINK
0.74
Download
0.73
Activations Density 0.073%