INDEX
Explanations
mentions of a specific individual's name, likely related to the name "Uler" or variations thereof
New Auto-Interp
Negative Logits
AxisAlignment
-0.68
engineer
-0.63
юра
-0.61
teacher
-0.60
Engineer
-0.59
accueillir
-0.58
waitress
-0.57
engineers
-0.57
MINISTER
-0.57
Engineer
-0.56
POSITIVE LOGITS
orer
1.20
uler
1.09
itzer
1.01
pler
0.91
iler
0.90
acher
0.88
asser
0.88
oner
0.88
ammer
0.87
ailer
0.87
Activations Density 0.106%