INDEX
Explanations
phrases related to technology and miscellaneous information
expressions and terminology related to drug use and its consequences
New Auto-Interp
Negative Logits
Reform
-0.61
UTH
-0.61
Grimoire
-0.58
Alter
-0.57
Teacher
-0.56
Disorder
-0.55
Advance
-0.55
Scholars
-0.54
HuffPost
-0.54
Guild
-0.54
POSITIVE LOGITS
preferably
0.79
mustard
0.66
thirsty
0.66
onics
0.64
sure
0.62
kidding
0.61
inki
0.61
sod
0.60
osuke
0.60
onna
0.59
Activations Density 0.746%