Electronic health records (EHRs) are often used by clinical and data researchers in numerous ways for various scientific investigations. When sharing patient information, certain precautions must be followed as to prevent the risk of a malicious actor being able to extract sensitive information. This study examines an experimental method of removing potentially identifiable information from free text medical notes by finding and removing phrases which are statistically uncommon. Furthermore, this study assesses if this method reduces risk of identification while also maintaining the utility of the data. The method involves analyzing a free-text dataset by first breaking all text up into fixed length phrases. The frequencies of these phrases are then tracked across the entire dataset on a per-patient, per-note, and dataset-wide basis. To benchmark the method, notes are de-identified using the method and privacy and utility are tested under different conditions. The results from using this method on real clinical notes are expected to produce text that will not only be more secure but will also retain information useful for applications such as machine learning, natural language processing, and data analysis. If this method proves to be successful, it could lead to institutions being able to share medical notes with researchers more easily. This in turn would eliminate a major obstacle which medical researchers face, as it would give them access to more data. Finally, when data are shared between institutions for research, the risk of identification can be represented as an objective and quantifiable metric.