INDEX
Explanations
terms with the suffix '-ista', which could be related to occupations or ideologies
terms related to various types of specialists or experts
New Auto-Interp
Negative Logits
Fish
-0.68
glers
-0.66
MENTS
-0.65
Norse
-0.65
brother
-0.63
lying
-0.63
sub
-0.63
eric
-0.63
tra
-0.62
relations
-0.62
POSITIVE LOGITS
ista
1.33
istas
1.22
terday
0.83
llan
0.79
ignt
0.78
ea
0.77
uto
0.75
Libre
0.75
yip
0.74
uthor
0.74
Activations Density 0.010%