INDEX
Explanations
occurrences of the word "drop" and its variations in various contexts
New Auto-Interp
Negative Logits
anthene
-0.72
usitis
-0.67
alluminio
-0.66
]';
-0.66
stringBuilder
-0.64
erus
-0.64
怎样
-0.64
contextLoads
-0.63
חיצוניים
-0.61
geladeira
-0.61
POSITIVE LOGITS
drop
1.70
drop
1.65
drops
1.65
DROP
1.65
drops
1.54
Drops
1.53
Drop
1.50
Drop
1.50
DROP
1.41
DRO
1.39
Activations Density 0.033%