INDEX
Explanations
references to grills and grilling
references to various types of grills and grilling activities
New Auto-Interp
Negative Logits
patient
-0.77
enforcement
-0.68
abuse
-0.67
rians
-0.63
Rockefeller
-0.62
-)
-0.62
mater
-0.60
orough
-0.60
âĸĪâĸĪ
-0.59
Surveillance
-0.59
POSITIVE LOGITS
grill
1.19
Grill
1.06
grilled
0.91
iffin
0.80
becue
0.79
rification
0.79
oven
0.78
pie
0.78
wich
0.78
seasoning
0.76
Activations Density 0.014%