Healthy Algorithms
Skip to content
  • Home
  • About
← CT4GH: Algorithmic Thinking
Journal Club: Ten Frequently Asked Questions About Implicit Measures and Their Frequently Supposed, But Not Entirely Correct Answers →
February 15, 2017 · 8:00 am

Websearch for Text Clustering

Here are some interesting results:

http://datascience.stackexchange.com/questions/979/algorithms-for-text-clustering

Click to access text-cluster.pdf

Anything I should add to my reading list?

Share this:

  • Click to share on Facebook (Opens in new window) Facebook
  • Click to share on X (Opens in new window) X
Like Loading...

Related

3 Comments

Filed under Uncategorized

← CT4GH: Algorithmic Thinking
Journal Club: Ten Frequently Asked Questions About Implicit Measures and Their Frequently Supposed, But Not Entirely Correct Answers →

3 responses to “Websearch for Text Clustering”

  1. Steve T
    February 15, 2017 at 9:00 pm

    Maybe check into word/sentence/paragraph/document embeddings as a precursor for clustering.

  2. Abraham Flaxman
    February 15, 2017 at 10:57 pm

    Like word2vec? https://www.tensorflow.org/tutorials/word2vec

  3. Steve T
    February 15, 2017 at 11:03 pm

    Yep, that’s a good place to start. I think the latest thing is to be able to summarize larger chunks of text with some sort of attention model.

  • Posts

    • New year’s eve edition of JPC
    • AI for Interview Prep
    • AI in Epi 554 (parts 4 and 5 of 5)
  • aco ai ai4hm algorithms baby animals Bayesian books conference contest costs dataviz data viz disease modeling dismod diversity diversity club free/open source funding gaussian processes gbd global health health inequality health metrics health records idv IDV4GH ihme infoviz ipython iraq journal club machine learning malaria matching algorithms matchings MCMC media microsimulation mortality mpld3 my research Mysteries networks networkx optimization orms pandas privacy probability public health pymc pymc3 python random effects reading list reproducible research reproductive health research jobs seminar sklearn software carpentry spanning trees sparql statistics stats survey talks TCS teaching Theory Blogs travel tutorial va verbal autopsy vital registration
  • Theory Blogs

    • 0xDE (David Eppstein)
    • Computational Complexity (Lance Fortnow/Bill Gasarch)
    • In Theory (Luca Trevisan)
    • Luis von Blog (Luis von Ahn)
    • Machine Learning (Theory) (John Langford & others)
    • Micael Trick’s Operations Research Blog
    • My Slice of Pizza (Muthu)
    • Punk Rock Operations Research (Laura McLay)
    • Shtetl-Optimized (Scott Aaronson)
    • Structure and Strangeness (Aaron Clauset)
  • some rights reserved

    This material is released under the Creative Commons Noncommercial Attribution Share-Alike 3.0 License
  • Pages

    • About
  • February 2017
    M T W T F S S
     12345
    6789101112
    13141516171819
    20212223242526
    2728  
    « Jan   Mar »
  • Archives

  • Meta

    • Create account
    • Log in
    • Entries feed
    • Comments feed
    • WordPress.com
Healthy Algorithms · A blog about algorithms, combinatorics, and optimization applications in global health informatics.
Blog at WordPress.com.
  • Reblog
  • Subscribe Subscribed
    • Healthy Algorithms
    • Join 181 other subscribers
    • Already have a WordPress.com account? Log in now.
    • Healthy Algorithms
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Copy shortlink
    • Report this content
    • View post in Reader
    • Manage subscriptions
    • Collapse this bar
%d