INDEX
Explanations
phrases related to governmental actions and policies
concepts related to choices and their implications
New Auto-Interp
Negative Logits
20439
-0.80
ï¸ı
-0.80
depth
-0.75
":["
-0.69
frequ
-0.68
particularly
-0.68
often
-0.68
reputable
-0.67
Often
-0.67
estones
-0.67
POSITIVE LOGITS
Frankenstein
1.08
capit
0.85
reinvent
0.82
Trojan
0.81
equivalent
0.81
shorthand
0.80
undo
0.79
legalized
0.78
rewriting
0.78
inverse
0.77
Activations Density 0.406%