INDEX
Explanations
Gemma team, widely available
New Auto-Interp
Negative Logits
rem
0.67
Ratings
0.64
রহিম
0.60
yaad
0.60
र्जर
0.59
ફિલ્
0.59
աբ
0.57
Bewertungen
0.57
Ната
0.57
heim
0.56
POSITIVE LOGITS
Optimal
0.65
柞
0.62
Optimal
0.61
侏
0.59
deterred
0.58
occlusion
0.58
মাল
0.58
dipping
0.57
黑暗
0.57
锁定
0.57
Activations Density 0.096%