Hoarseness is a common symptom reported to generalist healthcare providers, with approximately 1% of the clinical population being affected by it each year. It can be caused by multiple etiologies, such as hoarseness due to a cold, acid reflux, or laryngeal cancer. Perceptual evaluation of the voice is inaccurate, and it is therefore difficult to differentiate between hoarseness requiring urgent referral for specialty evaluation (i.e. laryngeal cancer) versus a disorder that could be managed without specialty care (i.e. acute laryngitis). The current gold standard of diagnosis for hoarseness is laryngoscopy, an in-clinic endoscopy recording of the larynx performed by an otolaryngologist specialist. Our research team seeks to improve perceptual voice evaluation by developing and testing machine learning algorithms which analyze voice for underlying pathology, beginning with an algorithm which screens voice for laryngeal masses. We hypothesize that our algorithm will have greater than 80% sensitivity and specificity in the classification of voice samples from patients with laryngeal masses. To test this, we are developing a large, prospective database of voice samples from a laryngology clinic using a smartphone application. Subjects are adult patients presenting to the laryngology clinic, with and without voice disorders, who have had a recent laryngoscopy exam and no laryngeal surgery within the past three months. We are collecting patient history which could influence voice quality, such as age, gender, alcohol use, smoking history, and subject-perceived voice disorder impact. After collection of the voice sample and patient history, cases are classified into underlying pathologic categories. We see recruitment of a well-classified and prospective patient population with a range of voice disorders. This work could lead to improved screening of patients with hoarseness in underserved and primary care settings, and more appropriate and timelier specialist referrals and treatment.