INDEX
Explanations
phrases that express emotional states or personal reflections
processes or actions
predicting next words in phrases
New Auto-Interp
Negative Logits
Климат
-0.55
hız
-0.52
Ỉ
-0.50
sivu
-0.48
Relations
-0.47
Ablauf
-0.46
خة
-0.45
Fazit
-0.45
frain
-0.45
'][]
-0.44
POSITIVE LOGITS
SequentialGroup
0.79
0.79
الاطلاع
0.72
ViewFeatures
0.72
AnimationsModule
0.72
]")]
0.72
tartalomajánló
0.70
TemporalType
0.70
насељу
0.68
ujednoznacz
0.68
Activations Density 0.433%