INDEX
Explanations
numeric ranges
numerical values and ranges
New Auto-Interp
Negative Logits
utenberg
-0.69
Flavoring
-0.68
ota
-0.67
park
-0.67
awar
-0.65
Parish
-0.65
aceous
-0.64
Ĥª
-0.62
Alic
-0.62
Parker
-0.60
POSITIVE LOGITS
31
1.26
33
1.15
34
1.14
32
1.13
35
1.12
31
1.12
30
1.11
36
1.09
37
1.08
38
1.07
Activations Density 0.077%