HWP is Hangeul Word Processor developed by Hancom , and it is Korea's representative office software.
It uses the .hwp file extension and is widely used in Businesses, Schools, and Government Institutions, and more.
Therefore, if you're a developer in South Korea, you've likely had (or will have) experience dealing with .hwp documents.
Unfortunately, it's not yet integrated with LangChain, so we'll need to use a custom-implemented HWPLoader with langchain-teddynote and langchain-opentutorial .
In this tutorial, we'll implement a HWPLoader that can load .hwp files and extract text from them.
Regulations on the Establishment and Operation of the National Artifical Intelligence Committee[Effective Augst 6, 2024] [Presidential Decree No. 34787, Enacted August 6, 2024]Regulations on the Establishment and Operation of the National Artificial Intelligence Committee Ministry of Government Legislation- / - National Statutory Information Center Reason for Enactment [Enactment]◇ Purpose To establish the National Artificial Intelligence Committee under the President to strengthen national competitiveness, protect national interests, and improve the quality of life for citizens by promoting the artificial intelligence industry and creating a trustworthy AI usage environment.◇ Main Contents A. Establishment and Functions of the National AI Committee (Article 2) 1) The National AI Committee shall be established under the President to efficiently deliberate and coordinate major policies for promoting the AI industry and establishing a foundation of trust in AI. 2) The Commit
len(docs) # Check the number of documents
1
print(docs[0].metadata) # Information about the document