当前位置:首页 > 报告详情

利用人工智能改进数据发现——挪威国家数据门户.pdf

上传人: 新** 编号:937398 2025-10-15 17页 813.10KB

1、Norways National Data Portal improving data discovery usingAI Data.norge.no is the national data portal of NorwayThe database consists of RDF-descriptions compatible to specifications and standards(i.e.dcat-ap-no)In this presentation I will demonstrate how and why we turned to using Artificial intel

2、ligence(AI)and Large Language Models(LLMs)Introduction Datasets are often described using a field specific jargon.Users looking for datasets did not always know the terms used in the descriptions of the datasets they were interested in.Users did not know what data they needed but often knew what the

3、y wanted to use the data for.Background what were the user needs?What alternatives did we consider?Make the data owners improve their descriptionsTune relevance of elastic search engineImprove documentation and information about the searchExperiment with Artificial intelligence We wanted more experi

4、ence with AI The Minister of Digitalisation and Public Governance in Norway expressed that she wanted more of the public sector to use AI.LLM technology works well to roughly navigate and extract information from large text hubs using natural language.Short way from idea to market.Possible to alter

5、as we gathered more insight and knowledgeWhy AI?Why LLM search?Worries before we startedCostQualityLack of AI-competence Security and privacy Cross sector cooperation and initiative We had knowledge about our database.We had well structured metadata Others had time and resources to experiment with A

6、I.What made it possible to start experimenting?Elapsed time:3 weeks.Intentionally made with the same tech stack we already use.Cost:170 EUR/month of which 90%is cost related to Postgres database.Limitations:Based on one single extraction from the dataset catalogue.No dynamic update of data.The proto

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,以下是全文主要内容的简明概括: 1. **挪威国家数据门户(data.norge.no)**:该门户使用人工智能(AI)和大型语言模型(LLM)来改善数据发现。 2. **背景**:用户在寻找数据集时,往往不知道描述中的术语,也不知道所需数据的具体用途。 3. **解决方案**:采用AI和LLM技术,通过自然语言处理来提取和导航大量文本信息。 4. **实施过程**:从原型开发到生产部署历时3个月,成本约170欧元/月。 5. **原型测试**:用户测试和内部反馈均积极,成本低于预期。 6. **生产部署**:2024年6月,AI搜索作为国家数据门户的一部分投入生产。 7. **工作原理**:使用Postgres数据库和Vertex AI进行数据向量化,LangChain创建提示以过滤不相关数据。 8. **优点**:易于替换LLM,成本低,可控制数据共享,开源。 9. **缺点**:LLM依赖性,未训练所有术语,文本长度限制。 10. **成本**:每月总成本约25欧元,处理查询约2000个/月。 11. **环境影响**:通过减少硬件需求和使用,节省能源。 12. **结论**:鼓励使用AI,注意安全和隐私,持续开发AI解决方案。
挪威数据门户的革新之路" 挪威数据发现新体验" 挪威数据门户AI搜索揭秘"
客服
商务合作
小程序
服务号
折叠