It is an annotated dataset CONSD through the improved distantly supervised strategy Ont4RE for entity-property relation extraction in the construction industry.
More details about Ont4RE can be referred to another repo Ontology-for-Relation-Extraction-Ont4RE
corpus.txtis the file containing sentence pool;corpus_chinese_word_segmentation.txtis the file containing chinese-segmented sentence pool;CEMO_triples.txtis the file containing ontological classes;\CONSDis the annotated sentences using the Ont4RE;\CONSD_ruleis the annotated sentences using the traditional distantly supervised strategy.
If you find CONSD dataset is helpful for your research, please consider giving a star and citing our paper:
Junjie Jiang, Chengke Wu, Wenjie Sun, Yong He, Yuanjun Guo, Yang Su, Zhile Yang. Ontology-based distant supervision for extracting entity-property relations in construction documents.
Any question please contact junj.chiang1102@gmail.com.
