INDEX
Explanations
Google AI models Gemini Gemma
New Auto-Interp
Negative Logits
t
0.64
as
0.51
ع
0.50
v
0.49
ت
0.47
a
0.45
ن
0.45
никова
0.44
Partizan
0.44
Kwiat
0.44
POSITIVE LOGITS
doua
0.48
ﻒ
0.48
FORD
0.48
ardier
0.47
кноп
0.46
humming
0.45
svc
0.44
ethanol
0.44
класу
0.44
മനു
0.43
Activations Density 0.104%