INDEX
Explanations
references to martial arts styles and ranks
New Auto-Interp
Negative Logits
radio
-0.14
zia
-0.14
Fle
-0.14
Schl
-0.14
gradient
-0.14
Gradient
-0.13
pap
-0.13
ppe
-0.13
opaque
-0.13
Ange
-0.13
POSITIVE LOGITS
training
0.22
martial
0.21
-training
0.21
Training
0.20
Training
0.20
training
0.20
Martial
0.19
Instructor
0.18
dojo
0.18
instructors
0.18
Activations Density 0.090%