INDEX
Explanations
categorical labels and structured data formats
category: and source:
New Auto-Interp
Negative Logits
ftagPool
-0.81
חיצוניים
-0.61
zijne
-0.57
Geſch
-0.56
нгред
-0.54
exitRule
-0.53
بيها
-0.53
Vidite
-0.52
dieſer
-0.52
Winaray
-0.51
POSITIVE LOGITS
:
0.47
Type
0.44
type
0.40
ISNI
0.40
KURZBESCHREIBUNG
0.38
ante
0.36
type
0.36
Brandt
0.36
Vue
0.36
/
0.36
Activations Density 0.062%