INDEX
Explanations
numerical values and codes
numerical data and statistics
New Auto-Interp
Negative Logits
nomine
-0.96
annis
-0.86
contrace
-0.79
corrid
-0.78
neighb
-0.74
referen
-0.73
hemor
-0.71
misunder
-0.70
interf
-0.69
ModLoader
-0.69
POSITIVE LOGITS
partName
0.93
df
0.81
attRot
0.77
370
0.73
dict
0.72
238
0.71
649
0.70
698
0.70
çļĦ
0.70
fc
0.68
Activations Density 0.122%