INDEX
Explanations
terms related to concepts or abstract ideas
New Auto-Interp
Negative Logits
GEBURTSDATUM
-0.68
PerformLayout
-0.65
Notae
-0.65
Jeografia
-0.61
firstly
-0.60
Revenir
-0.59
oprot
-0.59
انجليز
-0.58
featureID
-0.58
hens
-0.58
POSITIVE LOGITS
Idea
1.23
idea
1.15
Ideas
1.15
Idea
1.14
Ideas
1.09
ideas
1.06
ideas
0.93
IDEA
0.93
Ide
0.91
IDEA
0.91
Activations Density 0.119%