INDEX
Explanations
weights or measurements in kilograms
references to measurements of weight in kilograms
New Auto-Interp
Negative Logits
gotten
-0.81
Dee
-0.62
irie
-0.62
zig
-0.60
structed
-0.60
Maker
-0.60
Meier
-0.60
Explain
-0.60
bidden
-0.59
Fault
-0.59
POSITIVE LOGITS
atsu
0.86
gorilla
0.81
kg
0.80
omez
0.79
ammonia
0.76
rieg
0.75
kg
0.75
ross
0.73
emouth
0.73
eter
0.73
Activations Density 0.010%