INDEX
Explanations
the letter "P" followed by a number
references to a specific variable or parameter labeled as 'P'
New Auto-Interp
Negative Logits
diplom
-0.65
Lauder
-0.64
Jama
-0.63
neoc
-0.63
orche
-0.61
Rih
-0.61
ancest
-0.60
kinderg
-0.60
Glas
-0.59
vested
-0.59
POSITIVE LOGITS
airs
1.28
adding
1.24
ixels
1.24
ivot
1.20
aired
1.18
ainted
1.15
EEK
1.15
ipes
1.14
icking
1.13
ylon
1.12
Activations Density 0.040%