INDEX
Explanations
instances of significant findings or impacts in research
New Auto-Interp
Negative Logits
šinou
-0.68
なんでも
-0.62
Filmo
-0.59
来看看
-0.58
éd
-0.57
acostumb
-0.57
wzor
-0.56
صوتيه
-0.55
obligé
-0.53
refroid
-0.53
POSITIVE LOGITS
significant
4.22
significant
3.87
Significant
3.82
Significant
3.82
SIGNIFIC
3.16
significativa
2.69
significativo
2.66
signific
2.54
substantial
2.52
significativos
2.46
Activations Density 0.117%