INDEX
Explanations
words indicating accuracy, correctness, or truth
terms related to accuracy or correctness
New Auto-Interp
Negative Logits
venture
-0.71
wings
-0.69
unia
-0.69
ventures
-0.67
raints
-0.67
ury
-0.62
Throne
-0.62
*/(
-0.61
queue
-0.59
bean
-0.59
POSITIVE LOGITS
accurate
3.52
inaccurate
2.38
accuracy
2.16
accurately
2.06
correct
1.76
precise
1.74
inacc
1.65
reliable
1.63
truthful
1.57
accur
1.56
Activations Density 0.022%