{"id":854,"date":"2023-04-09T17:25:00","date_gmt":"2023-04-09T08:25:00","guid":{"rendered":"https:\/\/staka.jp\/wordpress\/?p=854"},"modified":"2023-04-09T17:25:00","modified_gmt":"2023-04-09T08:25:00","slug":"pile%e3%81%ae%e6%96%87%e5%ad%97%e6%a7%8b%e6%88%90%ef%bc%88%e3%81%aa%e3%81%9ccerebras-gpt%e3%81%a7%e6%97%a5%e6%9c%ac%e8%aa%9e%e3%81%8c%e4%bd%bf%e3%81%88%e3%82%8b%e3%81%ae%e3%81%8b%ef%bc%9f%ef%bc%89","status":"publish","type":"post","link":"https:\/\/staka.jp\/wordpress\/?p=854","title":{"rendered":"The Pile\u306e\u69cb\u6210\uff08\u306a\u305cCerebras-GPT\u3067\u65e5\u672c\u8a9e\u304c\u4f7f\u3048\u308b\u306e\u304b\uff1f\uff09"},"content":{"rendered":"\n<p>ChatGPT\u304c\u76db\u308a\u4e0a\u304c\u308b\u4e2d\u3001\u30aa\u30fc\u30d7\u30f3\u30e9\u30a4\u30bb\u30f3\u30b9\u306aLLM\uff08\u5927\u898f\u6a21\u8a00\u8a9e\u30e2\u30c7\u30eb\uff09\u958b\u767a\u3082\u884c\u308f\u308c\u3066\u3044\u308b\u3002\u305d\u306e\u4e2d\u3067Cerebras-GPT\uff08<a href=\"https:\/\/www.cerebras.net\/press-release\/cerebras-systems-releases-seven-new-gpt-models-trained-on-cs-2-wafer-scale-systems\">Cerebras Systems Releases Seven New GPT Models Trained on CS-2 Wafer-Scale Systems &#8211; Cerebras<\/a>\uff09\u304c\u65e5\u672c\u8a9e\u3092\u4f7f\u3048\u308b\u3068\u805e\u3044\u3066\u8a66\u3057\u3066\u307f\u305f\u3002<\/p>\n\n\n\n<p>Colab Pro +\u3067A100 GPU\u3092\u4f7f\u3046\u3068\u3001<a href=\"https:\/\/huggingface.co\/cerebras\/Cerebras-GPT-6.7B\">cerebras\/Cerebras-GPT-6.7B \u00b7 Hugging Face<\/a>\u306b\u305d\u3063\u30666.7B\u306e\u30e2\u30c7\u30eb\u3092\u52d5\u4f5c\u3055\u305b\u308b\u3053\u3068\u304c\u3067\u304d\u308b[1]\u3002<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>import torch\ndevice = torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')\n\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\nfrom transformers import pipeline\n\ntokenizer = AutoTokenizer.from_pretrained(\"cerebras\/Cerebras-GPT-6.7B\")\nmodel = AutoModelForCausalLM.from_pretrained(\"cerebras\/Cerebras-GPT-6.7B\")\n\npipe = pipeline(\"text-generation\", model=model, tokenizer=tokenizer, device=device)\n\ntext = \"\u732b\u3068\u66ae\u3089\u3057\u306610\u5e74\u306b\u306a\u308b\u4eba\u306b\u805e\u3044\u305f\u3001\u732b\u3068\u4e00\u7dd2\u306b\u66ae\u3089\u3059\u3068\u304d\u306b\u5fc5\u8981\u306a\u3082\u306e\u7b2c1\u4f4d\u306f\"\ngenerated_text = pipe(text, max_length=500, do_sample=False, no_repeat_ngram_size=2)&#91;0]\nprint(generated_text&#91;'generated_text'])<\/code><\/pre>\n\n\n\n<p>\u4e0a\u8a18\u306e\u51fa\u529b\u306f\u300c<code>\u732b\u3068\u66ae\u3089\u3057\u306610\u5e74\u306b\u306a\u308b\u4eba\u306b\u805e\u3044\u305f\u3001\u732b\u3068\u4e00\u7dd2\u306b\u66ae\u3089\u3059\u3068\u304d\u306b\u5fc5\u8981\u306a\u3082\u306e\u7b2c1\u4f4d\u306f\u300c\u7b11\u9854\u300d\u3002\u7d9a\u3044\u3066\u300c\u604b\u611b\u300d\u3001\u300c\u81ea\u5206\u306e\u3053\u3068\u3092\u77e5\u3063\u3066\u3044\u308b\u300d\u3068\u3044\u3046\u7d50\u679c\u306b\u3002<\/code>\u300d[2]\u3001\u306a\u304a\u30012.7B\u30d1\u30e9\u30e1\u30fc\u30bf\u306e\u5834\u5408\u306f\u300c<code>\u732b\u3068\u66ae\u3089\u3057\u306610\u5e74\u306b\u306a\u308b\u4eba\u306b\u805e\u3044\u305f\u3001\u732b\u3068\u4e00\u7dd2\u306b\u66ae\u3089\u3059\u3068\u304d\u306b\u5fc5\u8981\u306a\u3082\u306e\u7b2c1\u4f4d\u306f\u3001\u305d\u308c\u305e\u308c\u306e\u4eba\u9593\u304c\u7b11\u3063\u3066\u3044\u308b\u3068\u3044\u3046\u3053\u3068\u3067\u3059\u3002<\/code>\u300d[2]\u3060\u3063\u305f\u3002<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>model = AutoModelForCausalLM.from_pretrained(\"cerebras\/Cerebras-GPT-13B\", torch_dtype=torch.float16)<\/code><\/pre>\n\n\n\n<p>\u3068\u3057\u3066Cerebras-GPT-13B\u3092\u4f7f\u3063\u305f\u6642\u306e\u7d50\u679c\u306f\u300c<code>\u732b\u3068\u66ae\u3089\u3057\u306610\u5e74\u306b\u306a\u308b\u4eba\u306b\u805e\u3044\u305f\u3001\u732b\u3068\u4e00\u7dd2\u306b\u66ae\u3089\u3059\u3068\u304d\u306b\u5fc5\u8981\u306a\u3082\u306e\u7b2c1\u4f4d\u306f\u3001\u300c\u7b11\u9854\u300d\u3002\u7d50\u679c\u3001\u305d\u308c\u306f\u79c1\u306e\u8a00\u8449\u3067\u306f\u306a\u304f\u3001\u81ea\u5206\u306e\u3082\u306e\u3067\u3042\u308b\u3002<\/code>\u300d[2]\u3060\u3063\u305f\u3002<\/p>\n\n\n\n<p>\u65e5\u672c\u8a9e\u3092\u4f7f\u3048\u308b\u3068\u8a00\u3063\u3066\u3082\u6027\u80fd\u306f\u304b\u306a\u308a\u5fae\u5999\u306a\u611f\u3058\u3002<br>\u203b\u4ee5\u964d\u3001\u7406\u7531\u3092\u63a2\u3063\u3066\u3044\u304f\u304c\u672c\u6765\u4f7f\u3048\u306a\u3044\u306f\u305a\u306e\u65e5\u672c\u8a9e\u3063\u307d\u3044\u6587\u5b57\u5217\u304c\u8868\u793a\u3055\u308c\u308b\u306e\u306f\u7d50\u69cb\u3059\u3054\u3044\u3068\u3044\u3046\u306e\u304c\u5168\u4f53\u7684\u306a\u611f\u60f3\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The Pile<\/h2>\n\n\n\n<p>Cerebras-GPT\u30b7\u30ea\u30fc\u30ba\u306f\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u3068\u3057\u3066<a href=\"https:\/\/pile.eleuther.ai\/\">The Pile<\/a>[3]\u3092\u7528\u3044\u3066\u3044\u308b\u3002\u8ad6\u6587\u306b\u66f8\u304b\u308c\u3066\u3044\u308b\u901a\u308aThe Pile\u306f\u82f1\u8a9e\u306e\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u3067\u3042\u308b\u3002\u305d\u306e\u305f\u3081\u57fa\u672c\u7684\u306b\u306f\u65e5\u672c\u8a9e\u3092\u4f7f\u3048\u306a\u3044\u306f\u305a\u306a\u306e\u3060\u304c\u3001The Pile\u3092\u7528\u3044\u3066\u5b66\u7fd2\u3055\u308c\u305fLLM\u306f\u65e5\u672c\u8a9e\u3092\u3042\u308b\u7a0b\u5ea6\u89e3\u91c8\u3057\u3066\u3044\u308b\u306e\u3067\u306f\uff1f\u3068\u601d\u3046\u52d5\u4f5c\u3092\u3059\u308b\u3002\u3053\u308c\u306fThe Pile\u69cb\u7bc9\u3067\u4f7f\u308f\u308c\u305f\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306b\u610f\u56f3\u305b\u305a\u65e5\u672c\u8a9e\u304c\u542b\u307e\u308c\u3066\u3044\u308b\u305f\u3081\u3068\u8a00\u308f\u308c\u3066\u3044\u308b\u3002<strong>\u524d\u7f6e\u304d\u304c\u9577\u304f\u306a\u3063\u305f\u304c\u3001\u672c\u8a18\u4e8b\u3067\u306f\u3053\u308c\u304c\u672c\u5f53\u306a\u306e\u304b\u691c\u8a3c\u3059\u308b\u3002<\/strong><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">\u691c\u8a3c\u65b9\u6cd5<\/h2>\n\n\n\n<p>The Pile\u3092\u30c0\u30a6\u30f3\u30ed\u30fc\u30c9\u3057\u542b\u307e\u308c\u308b\u6587\u5b57\u3092\u6587\u5b57\u5225\u306b\u30ab\u30a6\u30f3\u30c8\u3057\u3001\u4e0b\u8a18\u6761\u4ef6\u306e\u6587\u5b57\u3092\u65e5\u672c\u8a9e\u3068\u8003\u3048\u3066\u65e5\u672c\u8a9e\u306e\u69cb\u6210\u6bd4\u7387\u3092\u7b97\u51fa\u3057\u305f\u3002CJK\u306e\u540d\u306e\u901a\u308a\u65e5\u672c\u8a9e\u3067\u306f\u306a\u3044\u7528\u4f8b\u306e\u6f22\u5b57\u3092\u542b\u3081\u3066\u6570\u3048\u3066\u304a\u308a<strong>\u660e\u3089\u304b\u306b\u65e5\u672c\u8a9e\u5206\u3092\u591a\u304f\u30ab\u30a6\u30f3\u30c8\u3057\u3066\u3044\u308b<\/strong>\u3068\u3044\u3046\u554f\u984c\u304c\u3042\u308b\u306e\u3060\u304c\u3001\u3053\u3053\u3067\u306f\u3044\u3063\u305f\u3093\u7121\u8996\u3059\u308b\u3053\u3068\u306b\u3059\u308b[4]\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Unicode\u306eBlock=Hiragana<\/li>\n\n\n\n<li>Unicode\u306eBlock=Katakana<\/li>\n\n\n\n<li>Unicode\u306eBlock=CJK&nbsp;Unified&nbsp;Ideographs<\/li>\n<\/ul>\n\n\n\n<p>The Pile\u306e\u30c7\u30fc\u30bf\u3092\u5c55\u958b\u3057\u306a\u304c\u3089\u6587\u5b57\u5225\u30ab\u30a6\u30f3\u30c8\u3057\u3001\u305d\u306e\u5f8c\u3001\u4e0a\u8a18\u6761\u4ef6\u3092regex\u3067\u30c1\u30a7\u30c3\u30af\u3057\u65e5\u672c\u8a9e\u3068\u5168\u4f53\u306e\u6587\u5b57\u6570\u3092\u96c6\u8a08\u3057\u305f\u3002Python\u3067\u9069\u5f53\u306b\u66f8\u3044\u305f\u611f\u3058c6a.xlarge\u30671\u30d7\u30ed\u30bb\u30b9\u8fba\u308a9M\u30d0\u30a4\u30c8\/\u79d2\u7a0b\u5ea6\u306e\u901f\u5ea6\u3067\u51e6\u7406\u304c\u3067\u304d\u305f\u30022\u30d7\u30ed\u30bb\u30b9\u4e26\u5217\u3055\u305b\u308b\u30681\u65e5\u4ee5\u5185\u306b\u51e6\u7406\u304c\u7d42\u308f\u308b\u30a4\u30e1\u30fc\u30b8\u3067\u3042\u308b\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">\u691c\u8a3c\u7d50\u679c<\/h2>\n\n\n\n<p>\u691c\u8a3c\u7d50\u679c\u306f\u6b21\u8868\u306e\u901a\u308a\u3002The Pile\u306b\u304a\u3051\u308b\u65e5\u672c\u8a9e\u306e\u69cb\u6210\u6bd4\u7387\u306f\u308f\u305a\u304b0.07%\uff08\u7d04900 M\u6587\u5b57\uff09\u3060\u3063\u305f\u3002\u65e5\u672c\u8a9e\u306b\u8ca2\u732e\u3057\u3066\u3044\u308b\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306f\u300cOpenWebText2\u300d\u300cGithub\u300d\u304c\u5927\u304d\u304f\u3001\u7d9a\u3044\u3066\u300cYoutubeSubtitles\u300d\u3060\u3063\u305f\u3002<\/p>\n\n\n\n<div class=\"wp-block-group\"><div class=\"wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained\">\n<figure class=\"wp-block-table is-style-stripes\"><table><tbody><tr><td class=\"has-text-align-left\" data-align=\"left\"><strong>\u30c7\u30fc\u30bf\u540d<\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong>\u5168\u4f53\u306e\u6587\u5b57\u6570<\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong>\u3046\u3061\u65e5\u672c\u8a9e<\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong>\u65e5\u672c\u8a9e\u69cb\u6210\u6bd4\u7387<\/strong><\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\"><strong><span style=\"text-decoration: underline;\">\u25a0\u5168\u4f53<\/span><\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong><span style=\"text-decoration: underline;\">1,297,182,716,948<\/span><\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong><span style=\"text-decoration: underline;\">889,523,734<\/span><\/strong><\/td><td class=\"has-text-align-right\" data-align=\"right\"><strong><span style=\"text-decoration: underline;\">0.07%<\/span><\/strong><\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Pile-CC<\/td><td class=\"has-text-align-right\" data-align=\"right\">230,770,471,687<\/td><td class=\"has-text-align-right\" data-align=\"right\">14,799,633<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.01%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">PubMed Central<\/td><td class=\"has-text-align-right\" data-align=\"right\">181,751,973,640<\/td><td class=\"has-text-align-right\" data-align=\"right\">18,441,201<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.01%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Books3<\/td><td class=\"has-text-align-right\" data-align=\"right\">153,023,754,599<\/td><td class=\"has-text-align-right\" data-align=\"right\">30,846,052<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.02%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">OpenWebText2<\/td><td class=\"has-text-align-right\" data-align=\"right\">124,271,071,981<\/td><td class=\"has-text-align-right\" data-align=\"right\">390,955,625<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.31%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">ArXiv<\/td><td class=\"has-text-align-right\" data-align=\"right\">113,558,856,961<\/td><td class=\"has-text-align-right\" data-align=\"right\">307,774<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Github<\/td><td class=\"has-text-align-right\" data-align=\"right\">96,487,741,975<\/td><td class=\"has-text-align-right\" data-align=\"right\">314,200,423<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.33%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">FreeLaw<\/td><td class=\"has-text-align-right\" data-align=\"right\">79,956,003,366<\/td><td class=\"has-text-align-right\" data-align=\"right\">2,832<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">StackExchange<\/td><td class=\"has-text-align-right\" data-align=\"right\">64,805,917,113<\/td><td class=\"has-text-align-right\" data-align=\"right\">19,770,098<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.03%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Wikipedia (en)<\/td><td class=\"has-text-align-right\" data-align=\"right\">50,611,222,086<\/td><td class=\"has-text-align-right\" data-align=\"right\">5,979,976<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.01%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">USPTO Backgrounds<\/td><td class=\"has-text-align-right\" data-align=\"right\">47,025,652,534<\/td><td class=\"has-text-align-right\" data-align=\"right\">0<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">PubMed Abstracts<\/td><td class=\"has-text-align-right\" data-align=\"right\">39,086,727,467<\/td><td class=\"has-text-align-right\" data-align=\"right\">730<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Gutenberg (PG-19)<\/td><td class=\"has-text-align-right\" data-align=\"right\">26,723,093,699<\/td><td class=\"has-text-align-right\" data-align=\"right\">17,845<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">OpenSubtitles<\/td><td class=\"has-text-align-right\" data-align=\"right\">19,750,201,478<\/td><td class=\"has-text-align-right\" data-align=\"right\">35,070<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">DM Mathematics<\/td><td class=\"has-text-align-right\" data-align=\"right\">15,719,095,096<\/td><td class=\"has-text-align-right\" data-align=\"right\">0<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Ubuntu IRC<\/td><td class=\"has-text-align-right\" data-align=\"right\">11,184,940,471<\/td><td class=\"has-text-align-right\" data-align=\"right\">0<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">BookCorpus2<\/td><td class=\"has-text-align-right\" data-align=\"right\">9,378,252,620<\/td><td class=\"has-text-align-right\" data-align=\"right\">833,334<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.01%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">EuroParl<\/td><td class=\"has-text-align-right\" data-align=\"right\">8,516,900,087<\/td><td class=\"has-text-align-right\" data-align=\"right\">20<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">HackerNews<\/td><td class=\"has-text-align-right\" data-align=\"right\">7,872,312,505<\/td><td class=\"has-text-align-right\" data-align=\"right\">77,793<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">YoutubeSubtitles<\/td><td class=\"has-text-align-right\" data-align=\"right\">6,431,579,291<\/td><td class=\"has-text-align-right\" data-align=\"right\">93,060,437<\/td><td class=\"has-text-align-right\" data-align=\"right\">1.45%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">PhilPapers<\/td><td class=\"has-text-align-right\" data-align=\"right\">4,745,004,923<\/td><td class=\"has-text-align-right\" data-align=\"right\">194,891<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">NIH ExPorter<\/td><td class=\"has-text-align-right\" data-align=\"right\">3,814,590,930<\/td><td class=\"has-text-align-right\" data-align=\"right\">0<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><tr><td class=\"has-text-align-left\" data-align=\"left\">Enron Emails<\/td><td class=\"has-text-align-right\" data-align=\"right\">1,697,352,439<\/td><td class=\"has-text-align-right\" data-align=\"right\">0<\/td><td class=\"has-text-align-right\" data-align=\"right\">0.00%<\/td><\/tr><\/tbody><\/table><figcaption class=\"wp-element-caption\">The Pile\u306e\u65e5\u672c\u8a9e\u69cb\u6210\u6bd4\u7387<\/figcaption><\/figure>\n<\/div><\/div>\n\n\n\n<p>\u306a\u304a\u3001\u65e5\u672c\u8a9e\u3068\u3057\u3066\u542b\u307e\u308c\u308b\u6587\u5b57\u306e\u30c8\u30c3\u30d75\u306f\u6b21\u306e\u901a\u308a\u3060\u3063\u305f[5]\u3002\u3072\u3089\u304c\u306a\u306e\u51fa\u73fe\u983b\u5ea6\u304c\u901a\u5e38\u3068\u540c\u3058\u306a\u306e\u304b\u306f\u7591\u554f\u3067\u306f\u3042\u308b\u3002<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>\u300c\u306e\u300d\u7d0420M\u6587\u5b57<\/li>\n\n\n\n<li>\u300c\u3044\u300d\u7d0414M\u6587\u5b57<\/li>\n\n\n\n<li>\u300c\u7684\u300d\u7d0413M\u6587\u5b57<\/li>\n\n\n\n<li>\u300c\u306b\u300d\u7d0413M\u6587\u5b57<\/li>\n\n\n\n<li>\u300c\u308b\u300d\u7d0412M\u6587\u5b57<\/li>\n<\/ol>\n\n\n\n<h2 class=\"wp-block-heading\">\u7d50\u679c\u3068\u307e\u3068\u3081<\/h2>\n\n\n\n<p>\u30aa\u30fc\u30d7\u30f3\u306aLLM\u958b\u767a\u3067\u3088\u304f\u4f7f\u308f\u308c\u308b\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u3001The Pile\u306e\u69cb\u6210\u6587\u5b57\u304b\u3089\u65e5\u672c\u8a9e\u306e\u6bd4\u7387\u3092\u8003\u3048\u3066\u307f\u305f\u3002\u7d50\u679c\u3001\u591a\u3081\u306b\u898b\u7a4d\u3082\u3063\u3066900M\u6587\u5b57\u3001\u69cb\u6210\u6bd4\u73870.07%\u3068\u975e\u5e38\u306b\u5c11\u306a\u3044\u3002\u3053\u306e\u7a0b\u5ea6\u3057\u304b\u5165\u3063\u3066\u3044\u306a\u3044\u306b\u3082\u304b\u304b\u308f\u3089\u305a\u65e5\u672c\u8a9e\u6587\u3068\u3057\u3066\u4e00\u5b9a\u7a0b\u5ea6\u6210\u7acb\u3057\u305f\u6587\u304c\u8fd4\u3063\u3066\u304f\u308bCerebras-GPT-6.7B\u306f\u51c4\u3044\u3002<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">\u811a\u6ce8<\/h4>\n\n\n\n<p>[1] GPU\u3067\u52d5\u4f5c\u3055\u305b\u308b\u90e8\u5206\u3060\u3051\u82e5\u5e72\u30b3\u30fc\u30c9\u3092\u8ffd\u52a0\u3057\u3066\u3044\u308b\u3002<a href=\"https:\/\/huggingface.co\/cerebras\/Cerebras-GPT-13B\">cerebras\/Cerebras-GPT-13B \u00b7 Hugging Face<\/a>\u306f\u66f8\u304b\u308c\u305f\u624b\u9806\u3067\u306f\u30e1\u30e2\u30ea\u4e0d\u8db3\u306b\u306a\u308b\u3002float16\u306b\u3059\u308c\u3070\u52d5\u4f5c\u3059\u308b\u3002<br>[2] \u3053\u306e\u5f8c\u6539\u884c\u3057\u3066\u300c\u79c1\u306f\u3001\u305d\u308c\u305e\u308c\u306e\u8a00\u8449\u306b\u3064\u3044\u3061\u3083\u3063\u305f\u3002\u300c\u304a\u7236\u3055\u3093\u3001\u3053\u3093\u306a\u306b\u7f8e\u3057\u3044\u72ac\u304c\u3044\u307e\u3059\u304b?\u300d\u300c\u3053\u306e\u4eba\u3001\u3042\u306a\u305f\u306e\u307b\u3046\u304c\u7d20\u6575\u3067\u3059\u3088\u300d\u300d\u30fb\u30fb\u30fb\u3068\u8b0e\u306e\u6587\u5b57\u5217\u304c\u7d9a\u304f\u3002<br>[3] Gao, L., Biderman, S., Black, S., Golding, L., Hoppe, T., Foster, C., Phang, J., He, H., Thite, A., Nabeshima, N., Presser, S., and Leahy, C. 2020. The Pile: An 800GB Dataset of Diverse Text for Language Modeling.&nbsp;<em>arXiv preprint arXiv:2101.00027<\/em>.<br>[4]\u3068\u3066\u3082\u96d1\u306a\u3053\u3068\u306f\u627f\u77e5\u3057\u3064\u3064\u3001\u65e5\u672c\u8a9e\u5224\u5b9a\u306f\u7d50\u69cb\u96e3\u3057\u304f\u5224\u5b9a\u306b\u51dd\u308b\u3068\u51e6\u7406\u6642\u9593\u306e\u554f\u984c\u304c\u51fa\u308b\u305f\u3081\u4eca\u56de\u306f\u3053\u306e\u6761\u4ef6\u3067\u691c\u8a3c\u3057\u305f\u3002\u65e5\u672c\u8a9e\u5206\u3092\u591a\u3081\u306b\u30ab\u30a6\u30f3\u30c8\u3057\u3066\u3044\u3066\u51fa\u305f\u7d50\u679c\u3067\u7d50\u8ad6\u306b\u306f\u5f71\u97ff\u3057\u306a\u3044\u3002<br>[5] \u300c\u7684\u300d\u304c\u5165\u3063\u3066\u3044\u308b\u306e\u306f\u4e2d\u56fd\u8a9e\u3082\u5165\u3063\u3066\u3044\u308b\u304b\u3089\u304b\u306a\uff1f\u3068\u601d\u3046\u3002<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">\u305d\u306e\u4ed6<\/h4>\n\n\n\n<p>The Pile\u3092\u4f7f\u3063\u3066\u69cb\u7bc9\u3057\u305f\u5927\u898f\u6a21\u8a00\u8a9e\u30e2\u30c7\u30eb\u3067\u4f55\u6545\u304b\u65e5\u672c\u8a9e\u304c\u4f7f\u3048\u308b\u306e\u306f\u77e5\u3089\u308c\u3066\u3044\u305f\u3002\u305d\u306e\u7406\u7531\u3068\u3057\u3066Github\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306e\u65e5\u672c\u8a9e\u30b3\u30e1\u30f3\u30c8\u306b\u3088\u308b\u3082\u306e\u306a\u3069\u306e\u7406\u7531\u3092\u805e\u3044\u305f\u3053\u3068\u3082\u3042\u3063\u305f\u304c\u3001\u672c\u5f53\u304b\u78ba\u8a3c\u304c\u306a\u304b\u3063\u305f\u306e\u3067\u8abf\u3079\u3066\u307f\u305f\u3002\u7d50\u679c\u5f53\u305f\u3089\u305a\u3082\u9060\u304b\u3089\u305a\u3068\u3044\u3046\u611f\u3058\u3060\u3063\u305f\uff08\u69cb\u6210\u8981\u7d20\u3068\u3057\u3066\u306f\u305d\u308c\u306a\u308a\u306b\u591a\u3044\u304c\u534a\u5206\u306f\u8d85\u3048\u3066\u3044\u306a\u3044\uff09\u3002<\/p>\n\n\n\n<p>\u500b\u4eba\u7684\u306b\u306f1G\u6587\u5b57\u3044\u304b\u306a\u3044\u30ec\u30d9\u30eb\u306e\u30c7\u30fc\u30bf\u3067\u65e5\u672c\u8a9e\u80fd\u529b\u3092\u5b66\u7fd2\u3057\u3066\u3044\u308b\u3053\u3068\u306b\u9a5a\u304d\u3060\u3063\u305f\u3002\u7591\u554f\u306b\u601d\u3063\u305f\u3053\u3068\u306f\u8abf\u3079\u3066\u307f\u308b\u3068\u9762\u767d\u3044\u3002The Pile\u306bwikipedia-ja\u306e\u30c7\u30fc\u30bf\u3092\u6df7\u305c\u308b\u3060\u3051\u3067\u3082\u65e5\u672c\u8a9e\u80fd\u529b\u306f\u304b\u306a\u308a\u4e0a\u304c\u308b\u3093\u3058\u3083\u306a\u3044\u3060\u308d\u3046\u304b\u3002<a href=\"https:\/\/huggingface.co\/BlinkDL\/rwkv-4-raven\">BlinkDL\/rwkv-4-raven \u00b7 Hugging Face<\/a>\u306e\u3088\u3046\u306b1%\u7a0b\u5ea6\u3067\u3082\u65e5\u672c\u8a9e\u304c\u5165\u3063\u3066\u3044\u308b\u3068\u7d50\u69cb\u306a\u52b9\u679c\u304c\u3042\u308a\u305d\u3046\u306b\u601d\u3046\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ChatGPT\u304c\u76db\u308a\u4e0a\u304c\u308b\u4e2d\u3001\u30aa\u30fc\u30d7\u30f3\u30e9\u30a4\u30bb\u30f3\u30b9\u306aLLM\uff08\u5927\u898f\u6a21\u8a00\u8a9e\u30e2\u30c7\u30eb\uff09\u958b\u767a\u3082\u884c\u308f\u308c\u3066\u3044\u308b\u3002\u305d\u306e\u4e2d\u3067Cerebras-GPT\uff08Cerebras Systems Releases Seven New GPT Model &hellip; <a href=\"https:\/\/staka.jp\/wordpress\/?p=854\" class=\"more-link\"><span class=\"screen-reader-text\">&#8220;The Pile\u306e\u69cb\u6210\uff08\u306a\u305cCerebras-GPT\u3067\u65e5\u672c\u8a9e\u304c\u4f7f\u3048\u308b\u306e\u304b\uff1f\uff09&#8221; \u306e<\/span>\u7d9a\u304d\u3092\u8aad\u3080<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,11],"tags":[24,56],"class_list":["post-854","post","type-post","status-publish","format-standard","hentry","category-ai","category-11","tag-cerebras-gpt","tag-pile"],"_links":{"self":[{"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/854","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=854"}],"version-history":[{"count":0,"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/854\/revisions"}],"wp:attachment":[{"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=854"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=854"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/staka.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=854"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}