INDEX
Explanations
individuality, variation, no single answer
New Auto-Interp
Negative Logits
questionable
0.42
enhanced
0.39
lingering
0.38
FAV
0.38
feasible
0.37
нням
0.37
dubious
0.37
refin
0.37
ेयर
0.37
extended
0.36
POSITIVE LOGITS
Different
1.30
different
1.24
Different
1.21
different
1.18
Each
1.09
不同的
1.08
Each
1.07
each
1.06
diferentes
1.05
each
1.04
Activations Density 0.023%