INDEX
Explanations
different preferences and situations
New Auto-Interp
Negative Logits
categorie
0.46
Satu
0.44
रेंज
0.44
범위
0.42
domen
0.42
predom
0.42
category
0.41
dimensions
0.40
カテゴ
0.40
категория
0.40
POSITIVE LOGITS
preferences
0.69
situations
0.66
budgets
0.65
preferences
0.65
Preferences
0.64
Preferences
0.63
préférences
0.58
situaciones
0.57
situações
0.57
situation
0.57
Activations Density 0.130%