INDEX
Explanations
instances of objects 'coming in'
mentions of the word "in."
New Auto-Interp
Negative Logits
tein
-0.65
aldi
-0.62
Grab
-0.60
âĢ¢âĢ¢
-0.60
cas
-0.60
IAS
-0.59
DOM
-0.59
ilus
-0.58
rede
-0.58
FUL
-0.57
POSITIVE LOGITS
handy
1.13
escap
0.84
between
0.76
accordance
0.75
offensive
0.73
somew
0.73
increments
0.71
versely
0.70
lieu
0.69
abwe
0.69
Activations Density 0.057%