INDEX
Explanations
terms related to measurements and statistical comparisons
New Auto-Interp
Negative Logits
theless
-0.88
giaan
-0.77
Decken
-0.73
̸
-0.69
pédie
-0.68
Elise
-0.67
ilever
-0.65
DISTR
-0.64
helves
-0.62
NDE
-0.62
POSITIVE LOGITS
shots
1.63
shot
1.58
shoots
1.52
Shots
1.51
Shots
1.51
SHOT
1.46
Shot
1.43
shot
1.39
shots
1.38
shoot
1.37
Activations Density 0.073%