1. ããã¹ããã€ãã³ã°ã®åºç€ç¥è
æ å ±ã溢ããŠããçŸä»£ç€ŸäŒã«ãããŠã䟡å€ããæ å ±ãã©ããã£ãŠèŠã€ããç解ãã掻çšãããã¯å€§ããªèª²é¡ãšãªã£ãŠããŸãããã®åé¡ã解決ããããã®æåãªæ段ã®äžã€ãããããã¹ããã€ãã³ã°ãã§ãã
1.1 ããã¹ããã€ãã³ã°ãšã¯
ããã¹ããã€ãã³ã°ãšã¯ãéæ§é åããã¹ãããŒã¿ããæçšãªæ å ±ãæœåºãããã®ãã¿ãŒã³ããã¬ã³ããèŠã€ãåºãããšãæããŸããããã¯ãèªç¶èšèªåŠçïŒNLPïŒãæ©æ¢°åŠç¿ãªã©ã®æè¡ã掻çšããŠè¡ãããŸããèšãæãããšãããã¹ããã€ãã³ã°ã¯å€§éã®ããã¹ããã䟡å€ããæ å ±ãèŠã€ãåºãããã®æ å ±ã解éããããã®ããã»ã¹ãšãããã§ãããã
1.2 ããã¹ããã€ãã³ã°ã®å ·äœçãªæŽ»çšäŸ
ããã¹ããã€ãã³ã°ã¯å€å²ã«ãããåéã§æŽ»çšãããŠããŸããããšãã°ãSNSãã¬ãã¥ãŒãµã€ãã®ãŠãŒã¶ãŒã®ã³ã¡ã³ãããææ ãåæããååããµãŒãã¹ã®è©äŸ¡ãææ¡ããããã«å©çšãããããšããããŸãããŸãã倧éã®ãã¥ãŒã¹èšäºãã¬ããŒããããããã¯ãæœåºããéèŠãªãã¬ã³ããèŠã€ããããã«ã䜿çšãããŸãã
ãããã®æŽ»çšäŸãããããããã«ãããã¹ããã€ãã³ã°ã¯ç§ãã¡ãç®ã®åã«ããããã¹ãããŒã¿ããããæ·±ãæŽå¯ãåŸãããã®åŒ·åãªããŒã«ãšèšããã§ãããã
1.3 ããã¹ããã€ãã³ã°ã®äžè¬çãªããã»ã¹
ããã¹ããã€ãã³ã°ã®ããã»ã¹ã¯ä»¥äžã®ã¹ãããã§æ§æãããŠããŸãããŸããããŒã¿ã®åéãã§ãŒãºã§ã¯ãããã¹ãããŒã¿ãåéããŸãã次ã«ãååŠçãã§ãŒãºã§ã¯ãããã¹ãããŒã¿ã解æå¯èœãªåœ¢åŒã«å€æããŸãããã®ã¹ãããã§ã¯ãããã¹ãã®ã¯ãªãŒãã³ã°ãæ£èŠåãªã©ãè¡ãããŸãã
次ã«ãåæãã§ãŒãºã§ã¯ãåŠçãããããŒã¿ããæçšãªæ å ±ãæœåºããŸãããã®ã¹ãããã§ã¯ãããšãã°ãåèªã®åºçŸé »åºŠã®åæããããã¯ã®æœåºãªã©ãè¡ãããŸããæåŸã«ãçµæã®è§£éãšå¯èŠåãã§ãŒãºã§ã¯ãæœåºãããæ å ±ã解éããã°ã©ãããã£ãŒããªã©ãçšããŠå¯èŠåããŸãã
ãããã®ããã»ã¹ãéããŠãããã¹ããã€ãã³ã°ã¯å€§éã®ããã¹ãããŒã¿ããæçšãªæ å ±ãæœåºãããã®æ å ±ãå¯èŠåããç解ããããšãå¯èœã«ããŸãã
2. ããŒã¿ã®æºå
ããã¹ããã€ãã³ã°ãéå§ããåã«ããŸãã¯ããŒã¿ã®æºåãå¿ èŠãšãªããŸããããã§ã¯ãããã¹ãããŒã¿ã®ååŸæ¹æ³ãšãExcelã§ã®ããŒã¿æŽçæ¹æ³ã«ã€ããŠè§£èª¬ããŸãã
2.1 ããã¹ãããŒã¿ã®ååŸæ¹æ³
ããã¹ããã€ãã³ã°ãè¡ãããã«ã¯ããŸãããã¹ãããŒã¿ãéããå¿ èŠããããŸããããŒã¿ã¯ãèªèº«ã§éããæ¹æ³ãšãæ¢ã«å ¬éãããŠããããŒã¿ã»ããã䜿çšããæ¹æ³ããããŸããèªèº«ã§éããå Žåã¯ããŠã§ãã¹ã¯ã¬ã€ãã³ã°ãšåŒã°ããææ³ã§ãŠã§ããµã€ãããæ å ±ãååŸããããšãäžè¬çã§ãããŠã§ãã¹ã¯ã¬ã€ãã³ã°ã¯äžå®ã®ç¥èãšæè¡ãå¿ èŠã§ãããèªåã®ç 究課é¡ã«æé©ãªããŒã¿ãæã«å ¥ããããšãã§ããŸããäžæ¹ãå ¬éããŒã¿ã»ãããå©çšããå Žåã¯ããªãŒãã³ããŒã¿ãªããžããªãããŒã¿ã·ã§ã¢ãªã³ã°ãã©ãããã©ãŒã ããããã¹ãããŒã¿ãããŠã³ããŒãããŸãããããã¯ããŒã¿ã®å質ãä¿èšŒãããŠããäžãæéãå°ãªããšããå©ç¹ããããŸãã
2.2 ãšã¯ã»ã«ã§ã®ããŒã¿ã®æŽçæ¹æ³
ããŒã¿ãååŸãããã次ã«ãã®ããŒã¿ãæŽçããå¿ èŠããããŸãããã®éãExcelã¯éåžžã«äŸ¿å©ãªããŒã«ãšãªããŸããExcelã§ã¯ã”ããŒã¿”ã¡ãã¥ãŒã®”ããã¹ããåã«åå²”æ©èœã䜿ãããšã§ãäžã€ã®ã»ã«ã«ãŸãšãŸã£ãããã¹ããè€æ°ã®ã»ã«ã«åå²ããããšãã§ããŸããããã«ãããäŸãã°æç« å šäœãåèªã¬ãã«ã§åæããããšãå¯èœã«ãªããŸãã
ãŸãã”ãã£ã«ã¿”æ©èœã䜿ãããšã§ãç¹å®ã®æ¡ä»¶ã«ãããããããŒã¿ã ãã衚瀺ããããšãå¯èœã§ããããã«ãããå¿ èŠãªããŒã¿ã ããç¬æã«æœåºããããšãã§ããŸãã
ããã«ãExcelã§ã¯æ¡ä»¶ä»ãæžåŒèšå®ã䜿ãããšã§ãç¹å®ã®æ¡ä»¶ã«åºã¥ããŠã»ã«ã®è²ãå€æŽããããšãå¯èœã§ããããã«ãããèŠèŠçã«ããŒã¿ã®ç¹åŸŽãæããããšãã§ããããŒã¿ã®æŽçã«åœ¹ç«ã¡ãŸãã
ãããã®Excelã®æ©èœãé§äœ¿ããŠãããã¹ãããŒã¿ãå¹çããæŽçããŸãããã
以äžã®ã»ã¯ã·ã§ã³ã¯ãããã¹ãããŒã¿ã®ååŸæ¹æ³ãšExcelã§ã®ããŒã¿æŽçæ¹æ³ãç解ãããã圢ã§èª¬æããŠããŸããããã«ãããèªè ãããã¹ããã€ãã³ã°ã®ããŒã¿æºåãã§ãŒãºãææ¡ããããã®åå°ãç¯ãããšãã§ããŸãã
3. Excelã§ã®ããã¹ããã€ãã³ã°ã®ææ³
ããã¹ããã€ãã³ã°ã®ããŒã¿ã®æºåãæŽã£ããã次ã«Excelãçšããåæã«é²ã¿ãŸããããã§ã¯ãExcelã®åºæ¬çãªé¢æ°ãçšããåæææ³ãšãããã¹ãããŒã¿ã®ååŠçãããã¹ãã®ã¯ãªãŒãã³ã°ãšæ£èŠåã«ã€ããŠè§£èª¬ããŸãã
3.1 Excelã®é¢æ°ãçšããåºæ¬çãªåæææ³
Excelã¯æ§ã ãªäŸ¿å©ãªé¢æ°ãæäŸããŠãããããã䜿ãããšã§ããã¹ããã€ãã³ã°ã®åæ©çãªåæãè¡ãããšãã§ããŸããäŸãã°ããCOUNTIFãé¢æ°ã¯æå®ããæ¡ä»¶ã«äžèŽããã»ã«ã®æ°ãã«ãŠã³ãããŸããããã䜿ãã°ãç¹å®ã®åèªãããã¹ãäžã«äœååºçŸããããèšç®ããããšãã§ããŸãã
ãŸãããLENãé¢æ°ã䜿ããšãã»ã«å ã®æåæ°ãæ°ããããšãã§ããŸããããã«ãããããã¹ãã®é·ãã«åºã¥ãåæãå¯èœã«ãªããŸãã
3.2 ããã¹ãããŒã¿ã®ååŠç
ããã¹ããã€ãã³ã°ã®åã«è¡ãããéèŠãªã¹ãããã®äžã€ãããã¹ãããŒã¿ã®ååŠçã§ããååŠçã§ã¯ããTRIMãé¢æ°ãçšããŠäžèŠãªç©ºçœãåé€ãããããLOWERãé¢æ°ã䜿ã£ãŠããã¹ãããã¹ãŠå°æåã«å€æãããããŸããããã«ãããåæã®éã«ãã€ãºãšãªãåŸãèŠçŽ ãæé€ãã粟床ãäžããããšãå¯èœã§ãã
3.3 ããã¹ãã®ã¯ãªãŒãã³ã°ãšæ£èŠå
ååŠçã®äžç°ãšããŠãããã¹ãã®ã¯ãªãŒãã³ã°ãšæ£èŠåãéèŠã§ããã¯ãªãŒãã³ã°ã§ã¯ããSUBSTITUTEãé¢æ°ãçšããŠç¹å®ã®æåãä»ã®æåã«çœ®ãæããããšã§ãããã¹ãããäžèŠãªèšå·ãç¹æ®æåãåé€ããŸãã
äžæ¹ãæ£èŠåã§ã¯ããã¹ãå ã®æ å ±ãäžå®ã®åœ¢åŒã«å€æããŸããããšãã°ããã¹ãŠã®å€§æåãå°æåã«å€æããããšããæ°åãäžå®ã®èšå·ã§çœ®ãæããããšãªã©ãå«ãŸããŸããããã«ãããåãæå³ã®è¡šçŸã§ãç°ãªã圢åŒã§æžãããŠããå Žåã§ããããããåäžèŠã§ããããã«ãªããŸãã
4. ããã¹ãåæ
Excelã§ã®ååŠçãçµãã£ããã次ã«ããã¹ãåæã«é²ã¿ãŸããããã§ã¯ãåèªã®åºçŸé »åºŠåæãã³ã³ãã³ãåæãææ åæã®3ã€ã®ææ³ã«ã€ããŠè§£èª¬ããŸãã
4.1 åèªã®åºçŸé »åºŠåæ
åèªã®åºçŸé »åºŠåæã¯ãç¹å®ã®åèªãããã¹ãäžã«äœååºçŸãããã調æ»ããåæææ³ã§ããExcelã§ã¯ããCOUNTIFãé¢æ°ã䜿ãããšã§ç°¡åã«ãã®åæãå®è¡ããããšãã§ããŸãã
åèªã®åºçŸé »åºŠãåæããããšã§ããã®ããã¹ããã©ã®ãããªãããã¯ã«ã€ããŠè¿°ã¹ãããŠããã®ãããŸããäœããã®ããã¹ãã®äž»èŠãªããŒãã§ããã®ããç解ããã®ã«åœ¹ç«ã¡ãŸãã
4.2 ã³ã³ãã³ãåæ
ã³ã³ãã³ãåæã¯ãããã¹ãäžã®ç¹å®ã®ããŒã¯ãŒãããã¬ãŒãºãã©ã®çšåºŠã®é »åºŠã§çŸãããã調æ»ããåæææ³ã§ãããCOUNTIFãé¢æ°ã䜿ãã°ãç¹å®ã®ããŒã¯ãŒãããã¬ãŒãºã®åºçŸé »åºŠãç°¡åã«èšç®ããããšãã§ããŸãã
ã³ã³ãã³ãåæã¯ãããã¹ããã©ã®ãããªå 容ã§ãããããŸãããã®å 容ãèªè ã«ã©ã®ããã«äŒããããç解ããã®ã«åœ¹ç«ã¡ãŸãã
4.3 ææ åæ
ææ åæã¯ãããã¹ãäžã®ææ ãæèŠãå€æããåæææ³ã§ããããã¯ãããžãã£ãããããã¬ãã£ãããããã¥ãŒãã©ã«ããªã©ãããã¹ãã®ããŒã³ãåé¡ããããšã§è¡ãããŸãã
Excelã§ææ åæãè¡ãããã«ã¯ããŸãææ ã®èŸæžãæºåããå¿ èŠããããŸããããã¯ãååèªãããžãã£ããªæå³ãæã€ã®ãããã¬ãã£ããªæå³ãæã€ã®ããå®çŸ©ãããªã¹ãã§ãããã®èŸæžãçšããŠãããã¹ãäžã®ååèªãã©ã®ã«ããŽãªãŒã«è©²åœãããã調ã¹ãå šäœã®ææ ãè©äŸ¡ããŸãã
5. åæçµæã®å¯èŠå
ããã¹ãåæãå®äºãããã次ã«åæçµæãå¯èŠåããŸããExcelã®è±å¯ãªã°ã©ãæ©èœã掻çšããã°ãããŒã¿ãäžç®ã§ç解ãããã圢ã«å€æããããšãã§ããŸããããã§ã¯ãExcelã§ã®ã°ã©ãäœæãšããŒã¿ã®è§£éãçµè«ã®å°åºã«ã€ããŠèª¬æããŸãã
5.1 Excelã§ã®ã°ã©ãäœæ
Excelã§ã¯ãããŸããŸãªçš®é¡ã®ã°ã©ããäœæããããšãã§ããŸããåèªã®åºçŸé »åºŠã瀺ãããã®ãã¹ãã°ã©ã ãããŒã¯ãŒãã®é¢é£æ§ã瀺ãããã®ãããã¯ãŒã¯å³ãªã©ãç®çã«å¿ããã°ã©ããéžã¶ããšã倧åã§ãã
ã°ã©ãã¯ããæ¿å ¥ãã¿ãã®ãã°ã©ããã»ã¯ã·ã§ã³ããäœæããããšãã§ããŸããããŒã¿ãéžæããåŸãé©ããã°ã©ãã®çš®é¡ãéžã³ãŸããããExcelã¯ãéžæããããŒã¿ã«åºã¥ããŠã°ã©ããèªåçã«äœæããŸãã
5.2 ããŒã¿ã®è§£éãšçµè«ã®å°åº
ã°ã©ããäœæãããã次ã«ããŒã¿ã®è§£éãšçµè«ã®å°åºãè¡ããŸãããã®ããã»ã¹ã¯ãããŒã¿ããæ å ±ãåŒãåºãããã®æå³ãç解ããããšãç®çã§ãã
äŸãã°ãããåèªãé »ç¹ã«åºçŸããŠããå Žåããã®ããã¹ãããã®åèªã«é¢é£ãããããã¯ãäž»ã«æ±ã£ãŠããå¯èœæ§ããããŸãããŸããç¹å®ã®ããŒã¯ãŒããå ±ã«é »ç¹ã«çŸããå Žåããããã®ããŒã¯ãŒãã«ã¯äœããã®é¢é£æ§ããããããããŸããããã®ãããªæŽå¯ã¯ãããžãã¹ã®ææ決å®ãæŠç¥ç«æ¡ã«åœ¹ç«ã¡ãŸãã
ããããããŒã¿ã®è§£éã¯äžççžã§ã¯ãããŸãããåãããŒã¿ã§ãã解éãã人ã«ãã£ãŠçµè«ãå€ããããšããããŸãããã®ãããå¯èœãªéã客芳çã§ããã€ã¢ã¹ã®å ¥ããªãæ¹æ³ã§è§£éããããšãéèŠã§ãã
以äžã®ã»ã¯ã·ã§ã³ã¯ãããã¹ãåæçµæã®å¯èŠåãšè§£éã«ã€ããŠã®åºç€ãæäŸããŸããããã«ãããèªè ã¯åæçµæãããŸã掻çšããæçãªæŽå¯ãå°ãåºãããã®æ¹æ³ãç解ããããšãã§ããŸãã
7. ãŸãšããšæ¬¡ã®ã¹ããã
ãã®èšäºãéããŠãExcelã䜿çšããããã¹ããã€ãã³ã°ã®åºæ¬çãªæŠå¿µãšææ³ã«ã€ããŠç解ãæ·±ããããšãã§ããããšã§ããããããããExcelãçšããããã¹ããã€ãã³ã°ã«ã¯åŒ·ã¿ãšéçããããŸãããŸãããããªãåŠç¿ã®ããã®ãªãœãŒã¹ã«ã€ããŠã玹ä»ããŸãã
7.1 Excelã§ã®ããã¹ããã€ãã³ã°ã®éçãšåŒ·ã¿
Excelã¯ããŒã¿åæã®åå¿è ã«ãšã£ãŠã¢ã¯ã»ã¹ããããããŒã«ã§ãããåºæ¬çãªããã¹ããã€ãã³ã°ã®ææ³ãåŠã¶ã®ã«é©ããŠããŸãããããã倧éã®ããŒã¿ãè€éãªåæãæ±ãã«ã¯éçããããŸããäžæ¹ãããé«åºŠãªåæãè¡ãã«ã¯PythonãRãšãã£ãããã°ã©ãã³ã°èšèªãå¿ èŠã«ãªãã§ãããã
ããã«ãããããããExcelã®åŒ·ã¿ã¯ãã®çŽæçãªã€ã³ã¿ãŒãã§ã€ã¹ãšå¹ åºãå©çšè å±€ã«ãããŸããããã«ãããéããã°ã©ããŒã§ãããã¹ããã€ãã³ã°ã®åºæ¬ãæŽãããšãã§ããã®ã§ãã
7.2 ããã¹ããã€ãã³ã°ã®åŠç¿ãæ·±ããããã®ãªãœãŒã¹
ããã¹ããã€ãã³ã°ã®åŠç¿ãæ·±ããããã«ã¯ããªã³ã©ã€ã³ã³ãŒã¹ãæžç±ãæçšã§ããããšãã°ããCourseraãããedXãã§ã¯ãããã¹ããã€ãã³ã°ã«é¢ããè¬åº§ãæäŸãããŠããŸãããŸãããNatural Language Processing with PythonãããText Mining with Rããªã©ã®æžç±ãåŠç¿ãªãœãŒã¹ãšããŠæšå¥šãããŸãã
ãŸããå®éã®ãããžã§ã¯ãã«åãçµãããšãéèŠã§ããããŒã¿ã»ãããããŠã³ããŒãããŠèªåã§åæãè¡ãããããã¯Kaggleãªã©ã®ããŒã¿ãµã€ãšã³ã¹ã³ã³ããã£ã·ã§ã³ã«åå ããããšã§ãå®è·µçãªçµéšãç©ãããšãã§ããŸãã