INDEX
Explanations
major, important, significant
New Auto-Interp
Negative Logits
situación
0.79
silly
0.77
thing
0.76
fancy
0.72
superstition
0.71
something
0.70
situation
0.70
shenanigans
0.70
cosas
0.69
ridiculous
0.68
POSITIVE LOGITS
Important
1.07
重要な
1.04
mainstay
1.03
的重要
0.98
contributor
0.97
важли
0.96
important
0.95
গুরুত্বপূর্ণ
0.94
important
0.94
member
0.94
Activations Density 0.618%