Saturday, March 4, 2023

Show HN: gpt-graph. A simple, GPT-3 text to entity-relation graph generator https://ift.tt/PX1TVj7

Show HN: gpt-graph. A simple, GPT-3 text to entity-relation graph generator Hi HN! This is a simple text to entity-relation graph generator, powered by gpt-3 davinci model. The purpose is to feed it actual written data, to obtain a graph representation of entities and relationships mentioned in the text. Also, being able to identify entity attributes like gender, size, age ... My initial goal, was to make it able to process a large amount of text into a big single graph. The problem being the 4000 token limit the model has, I decided to take the approach of feeding the text in batches, and try to merge the incoming graph with the existing information each time. This is done by comparing the incoming node labels with those already in the graph, adding the new information to the existing nodes. This works somewhat, but sometimes entities get duplicated if they are mentioned slightly differently in the text. The comparation method could use some improvement clearly. A nice feature, is that you get to decide what types you want to extract. So if, for example, you are interested only in people, and companies in the text, you can tell the model to stick to that. You can also leave the types to the model discretion. Also, the application allows for saving / loading graphs to json files. These files can be used with Cytoscape Desktop Application, which is a nice side effect of using cytoscape.js. in the UI. I think tools like this can really be of help when going through dense documentation. To have a visual representation of the concepts, entities or whatever, can be really helpful in education, investigation, legal ... Would love to hear your thoughts on how this could be improved. https://ift.tt/RuBXj7d March 5, 2023 at 05:17AM

No comments:

Post a Comment