Advertisement
lamiastella

underfull \hbox (badness 1097) in paragraph

Aug 6th, 2020
4,167
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Latex 1.30 KB | None | 0 0
  1. underfull \hbox (badness 1097) in paragraph in lines 190--194
  2.  
  3. *copied lines 188 to 194 (including 194)
  4.  
  5. \paragraph{Data Cleaning and Pre-processing for LDA}
  6. % TODO_7: Alyssa: flesh this out
  7. While there is some flexibility regarding the format of the dataset (the system currently supports .xls, .xlsx, and .csv), it is nonetheless necessary that it at least contain a column labeled as ``Example''. This column will hold the text examples, with one document or, broadly speaking, textual entity, per row.
  8. % DT: These preprocessing steps are actually baked in. The user has no control.
  9. %When the user uploads their spreadsheet, they are able to select the preprocessing that is appropriate for their dataset. The user is additionally able to specify parameters for some of the preprocessing steps, such as common phrase removal and multi-word entity joining. The system does all of the preprocessing outlined in below by default, but the user can toggle specific processes on and off according to their needs.
  10. LDA employs a bag-of-words model, where each document is understood as an unordered collection of words; to make analysis more conducive to the discovery of useful topics, the system filters out extremely common and extremely rare words. The pre-processing steps we employ include the following:
  11. \begin{itemize}
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement