CARVIEW |
?mpandas??????n?T???I?f?[?^???͂̊?b?F?^?C?^?j?b?N?f?[?^?Z?b?g?ׂĂ݂悤?FPython?f?[?^????????
?@?B?w?K??f?B?[?v???[?j???O?ɂ????ė??K?ޗ??Ƃ??Ă悭?g????^?C?^?j?b?N?f?[?^?Z?b?g???g???āA???̊T?v??A???ʂƐ????Ɋ֘A?????邩?ǂ????ׂĂ݂܂??傤?B
?{?V???[?Y?Ɩ{?A?ڂɂ???
?@?{?V???[?Y?uPython?f?[?^????????v?́APython?̊?b???}?X?^?[?????l??ΏۂɈȉ??̂悤?ȁAPython???g???ăf?[?^?????????悤?Ƃ????Ƃ??ɕ֗??Ɏg????c?[????C?u?????A?t???[?????[?N?̎g?????̊?b???????????̂ł??B
- NumPy?i?uNumPy??????v?̖ڎ??????????j
- pandas?i?{?A?ځj
- Matplotlib
?@?Ȃ??A?{?A?ڂł͈ȉ??̃o?[?W???????g?p???Ă??܂??B
- Python 3.12
- pandas 2.2.1
?@?O???͌????l?Ƃ??̏????̎d??????????܂????B????̓^?C?^?j?b?N?f?[?^?Z?b?g???g???āA???̓??e???ȒP?Ɋm?F???Ă݂܂??B
?^?C?^?j?b?N?f?[?^?Z?b?g
?@?^?C?^?j?b?N?f?[?^?Z?b?g?̏ڍׂɂ??ẮuTitanic?F?^?C?^?j?b?N????q?҂̐????i?N???ʂȂǂ?13???ځj?̕\?`???f?[?^?Z?b?g?v???????????????B???????A?O?f?̃????N?ɂ??????f?[?^?Z?b?g?z?z?????????N??̂悤?Ȃ̂ŁA?????ł?GitHub??pandas???|?W?g??????擾?ł???titanic.csv?t?@?C?????g?????Ƃɂ??܂????B???̃t?@?C???͑O?f?̃????N?ʼn?????Ă???^?C?^?j?b?N?f?[?^?Z?b?g?ƈقȂ?_??????̂ŁA?ȒP?ɈقȂ?ӏ????܂Ƃ߂Ă????܂??傤?B
- ?s????891?s?ɂȂ??Ă???
- 'PassengerId'?s???lj?????Ă???
- ????̗폜????Ă???
?@??????̃^?C?^?j?b?N?f?[?^?Z?b?g??????͈ȉ??̂悤?ɂȂ??Ă??܂??B
- 'PassengerId'?F??qID?B?????l
- 'Survived'?F?????c???????ǂ????B?????l?i0?F???S?A1?F?????j
- 'Pclass'?F???q?N???X?B?????l?i1?F1???A2?F2???A3?F3???j
- 'Name'?F??q???B??????
- 'Sex'?F???ʁB??????imale?F?j???Afemale?F?????j
- 'Age'?F?N??B?????????_???l
- 'SibSp'?F???悵?Ă????Z???z??҂̐??B?????l
- 'Parch'?F???悵?Ă????e??q?̐??B?????l
- 'Ticket'?F?`?P?b?g?ԍ??B??????
- 'Fare'?F?^???B?????????_???l
- 'Cabin'?F?q???ԍ??B??????
- 'Embarked'?F??D?n?B??????iS?F?T?E?T???v?g???AC?F?V?F???u?[???AQ?F?N?C?[???Y?^?E???j
?@?????l?╂???????_???l?????ł͂Ȃ??A??????̃f?[?^???܂?ł??܂??B?????̃f?[?^???ǂ̂悤?ɏ???????悢?????l????K?v??????ł??傤?B
?@?????̗?̒???'Survived'??͑??̗?Ƃ͈قȂ鈵????????邱?Ƃ??悭????܂??B?^?C?^?j?b?N?f?[?^?Z?b?g?ł́A???̗?̃f?[?^?𑍍??I?Ɋ??Ă??????ʁA??q???g?????????c?????̂??ǂ??????l???邱?Ƃ??悭????܂??B?܂?A'Survived'??͑??̗???????Ƃ??Ĕ??????????ʂł???A???̗??'Survived'??̌??ʂݏo?????v???ł??B?O?҂?ړI?ϐ???????x???ƁA??҂?????ϐ???????ʂƂ??????Ƃ?????܂??B
?@?܂??A???͂Ƃ?????A??L?̃????N????titanic.csv?t?@?C?????_?E?????[?h???āA?ǂݍ???ł݂邱?Ƃɂ??܂??B
?@CSV?t?@?C???̓ǂݍ??݂ɂ?pandas.read_csv?????g???܂??B
import pandas as pd
df = pd.read_csv('titanic.csv')
df
?@Visual Studio Code?i?ȉ??AVS Code?j?ŐV?K?Ƀm?[?g?u?b?N???쐬???A??L?R?[?h?????s????ƁA???̂悤??titanic.csv?t?@?C???̓??e??DataFrame?I?u?W?F?N?g?Ƃ??ēǂݍ??܂?A???̓??e?i?̈ꕔ?j???\??????܂??B
?@????Ŏ?肠?????̏????͊????ł??B???̌?͎??̂悤?Ȃ??Ƃ?????Ă????܂??傤?B
- DataFrame?I?u?W?F?N?g?̊T?v?̒???
- ?^?C?^?j?b?N?f?[?^?Z?b?g?????????????ׂĂ݂?
DataFrame?I?u?W?F?N?g?̊T?v?ׂ?
?@DataFrame?I?u?W?F?N?g?̊T?v?ׂ?ɂ́A?ȑO?ɂ??Љ???ȉ??̃??\?b?h?Ȃǂ??g???܂??B
- pandas.DataFrame.head???\?b?h
- pandas.DataFrame.info???\?b?h
- pandas.DataFrame.tail???\?b?h
?@??Ō????R?[?h?̍ŏI?s?udf?v?????s???邾???ł??A????DataFrame?I?u?W?F?N?g?̐擪5?s?Ɩ???5?s?A???ꂩ?炱??DataFrame?I?u?W?F?N?g?ɂ?891?s?~12??̃f?[?^???i?[????Ă??邱?Ƃ???????܂??B?擪5?s?Ɩ???5?s???\??????Ă???̂ŁAhead???\?b?h??tail???\?b?h???????K?v?͂Ȃ??ł??傤????A?????ł?info???\?b?h???Ăяo???Ă݂܂??B
pandas.DataFrame.info???\?b?h
pandas.DataFrame.info(verbose=None, max_cols=None, show_counts=None)
?@DataFrame?I?u?W?F?N?g?̊T?v???o?͂???B????̃p?????[?^?[???ȉ??Ɏ????B?S?p?????[?^?[?ɂ??Ă?pandas?̃h?L???????g?upandas.DataFrame.info?v???Q?Ƃ̂??ƁB
- verbose?F?T?v??S?ĕ\?????邩?ǂ????̎w??BTrue???w?肷??ƊT?v??S?ĕ\?????AFalse???w?肷??ƍs???ƗA?f?[?^?^?̗v??Ȃǂ?????\??????B?ȗ????ɂ?pandas.options.display.max_info_columns?????̒l?????????Ȃ????True???w?肵???̂Ɠ????U?镑?????A???????False???w?肵???̂Ɠ????U?镑????????
- max_cols?F?T?v??S?ĕ\?????邩?A???Ȃ??\?????邩???ւ?????w?肷??B?ȗ????ɂ?pandas.options.display.max_columns?????̒l???w?肳?ꂽ???̂ƌ??Ȃ????
- show_counts?F?e??Ɋ܂܂?錇???l?ł͂Ȃ??l?̐???\?????邩?ǂ??????w?肷??BTrue???w?肷??Ə?ɕ\?????AFalse???w?肷??Ə?ɕ\?????Ȃ??B?ȗ????ɂ?DataFrame?I?u?W?F?N?g?̍s??????ї?pandas.options.display.max_info_rows?????????pandas.options.display.max_info_columns?????̒l?????????????ǂ????ŐU?镑???????肳???
?@?܂??͓??ɉ????w?肵?Ȃ???info???\?b?h???Ăяo???Ă݂܂??B
df.info()
?@VS Code?ł???????s???????ʂ??ȉ??Ɏ????܂??B
?@???̂悤?ɍs?C???f?b?N?X?iRangeIndex?j?Ɨx???icolumns?j?̐??A?e??Ɍ??????Ă??Ȃ??f?[?^???ǂꂾ???܂܂?Ă??邩??A???̗?̃f?[?^?^???????A?e?f?[?^?^?̗ǂꂾ?????邩?Ƃ???????\??????܂??B??????????'Cabin'??ɂ͌??????Ă???f?[?^????ϑ??????Ƃ???????܂??ˁBdtype??????ƁA??????iobject?^?j?ɂȂ??Ă???̂́A'Name'???'Sex'??A'Ticket'??A'Cabin'??A???ꂩ??'Embarked'??ł??B???̂悤?ȕ???????f?[?^?Ƃ??Ċi?[???Ă????ɂ??Ă????A???????v???O?????ŋ@?B?I?ɏ???????ړI?ōŏI?I?ɉ??炩?̐??l?ɕϊ?????K?v???o?Ă???ł??傤?B???̂??Ƃɂ??Ă͎???Ɍ?????????̂Ƃ??܂??B
?@????verbose?p?????[?^?[??False???w?肵?Ă݂܂??傤?B
df.info(verbose=False)
?@?ȉ??????s???ʂł??B
?@???x??max_cols?p?????[?^?[?ɁADataFrame?I?u?W?F?N?g?̗??????Ȃ??l???w?肵?Ă݂܂??B
df.info(max_cols=10)
?@????DataFrame?I?u?W?F?N?g?ɂ?12?̗???̂ŁA??????????Ȃ??l??max_cols?p?????[?^?[?ɓn???ꂽ?Ƃ??????Ƃł??B???̂Ƃ??ɂ́A???̂悤?Ɋe??̏?\??????Ȃ??Ȃ?܂??B
?@?Ō??show_counts?p?????[?^?[??False???w?肵?Ă݂܂??傤?B
df.info(show_counts=False)
?@???̂Ƃ??ɂ͎??̂悤?ɁA???????Ă??Ȃ??f?[?^?̌????\??????Ȃ??Ȃ?܂??i?e??̗x???Ƃ??̗?̃f?[?^?^???????\???????j?B
?@?????̏ꍇ?͓??Ƀp?????[?^?[???w?肹???ɌĂяo???̂??????ł??傤?BDataFrame?I?u?W?F?N?g?̊T?v???Ȍ??ɋ????Ă??炦??͂??ł??B
?@????1?ADataFrame?I?u?W?F?N?g?̊T?v??m?邽?߂̃??\?b?h??????܂??B???ꂪpandas.DataFrame.describe???\?b?h?ł??B
pandas.DataFrame.describe???\?b?h
pandas.DataFrame.describe(percentiles=None, include=None, exclude=None)
?@DataFrame?I?u?W?F?N?g?̊?{???v?ʂ?\??????B?p?????[?^?[???ȉ??Ɏ????B?ڍׂ?pandas?̃h?L???????g?upandas.DataFrame.describe?v???Q?Ƃ̂??ƁB
- percentiles?F?\??????p?[?Z???^?C???̎w??B?ȗ????ɂ?[.25, .5, .75]???w?肳?ꂽ???̂ƌ??Ȃ????
- include?F??{???v?ʂ?\???????̃f?[?^?^?̎w??B'all'???f?[?^?^??v?f?Ƃ??郊?X?g???w??ł???i??q?j?B?ȗ??????ꍇ?͐??l?^?̗?ɂ??Ă̂݊?{???v?ʂ?\??????
- exclude?F??{???v?ʂ̕\?????珜?O?????̃f?[?^?^?̎w??B?f?[?^?^??v?f?Ƃ??郊?X?g???w??ł???i??q?j?B?ȗ??????ꍇ?͓??ɏ??O??????̎w??͂???Ȃ??????ƌ??Ȃ????
?@??قǂƓ??l?A?܂??͂??̂܂?describe???\?b?h???Ăяo???Ă݂܂??B
df.describe()
?@VS Code?ł??̃R?[?h?????s????Ǝ??̂悤?ɂȂ?܂??B
?@?e??̃f?[?^???icount?j?A???ϒl?imean?j?A?W?????istd?j?A?ŏ??l?imin?j?A?l???ʐ??i25%?A50%?A75%?j?A?ő?l?imax?j???\??????邱?Ƃ???????܂????B?l???ʐ??Ƃ????̂́A?f?[?^???????????ɕ??ׂ???őS?̂?4???????A?ŏ??l????4????1?̏ꏊ?ɂ???f?[?^?A2????1?̏ꏊ?ɂ???f?[?^?A4????3?̏ꏊ?ɂ???f?[?^?????o???????̂ł??i??????25???^?C???A50???^?C???A75???^?C???ȂǂƌĂԂ??Ƃ?????܂??j?B????ɂ?????ׂ???10???A20???A?c?c?̂悤?ɕ??????邱?Ƃ?????A???̏ꍇ?̓p?[?Z???^?C???ƌĂԂ??Ƃ?????܂??B?l???ʐ???p?[?Z???^?C???͗?̒??ŌX?̃f?[?^???ǂ̂悤?ɕ??z???Ă??邩??????????ƌ???̂ɖ𗧂??܂??B
?@?Ⴆ?A'PassengerId'???1????891?܂ł̘A?ԂȂ̂ŁA?ŏ??l??1?A25???^?C???̒l??223.5?A50???^?C???̒l??446?A75???^?C???̒l??668.5?A?ő?l??891?Ɠ??Ԋu?ŕ???ł??܂??i????̓f?[?^?ɘA?Ԃŏ??Ԃ?t?????????Ȃ̂ŁA???ۂ̂Ƃ???A????ɂ͈Ӗ??͂???܂???j?B'Survived'??͎??S??0?ŁA??????1??2??ނ̃f?[?^?????i?[???Ă??Ȃ??̂ł?????l???ʐ??????Ă????܂?Ӗ??͂Ȃ??ł??傤?B'Pclass'??????l?ł??B'Age'??ɂ͈Ӗ??????肻???ł??B25???^?C???̒l??20.125?A75???^?C???̒l??38.0?Ƃ??????Ƃ́A??q?Ə?g???S?̂̔?????20????38?ō\??????Ă???Ƃ??????Ƃł??i?ӊO?ɎႢ?H?j?B????Ȋ????Ńf?[?^??ǂ݉????Ă????̂?describe???\?b?h?͖𗧂??Ƃ?????܂??B
?@?Ȃ??Apercentiles?p?????[?^?[?ɂ͎??̂悤?ɂǂ̈ʒu?̃f?[?^???擾?????????????X?g?Ɋ܂߂Ďw?肵?܂??B?ȉ??̓f?t?H???g?̎w??Ɠ????ł??B
df.describe(percentiles=[.25, .75, .95])
?@include?p?????[?^?[?͊?{???v?ʂ?\????????????w?肷??̂Ɏg???܂??B?f?t?H???g?ł͐??l??v?f?Ƃ???????ΏۂƂȂ?܂??B???̃p?????[?^?[?̎w????@?͊??????܂????A?ȒP?Ȃ̂?'integer'?i??????v?f?Ƃ????j??'float'?i?????????_????v?f?Ƃ????j?A'number'?i???l??v?f?Ƃ????j?A'object'?i??????Ȃǂ?v?f?Ƃ????j?Ȃǂ????X?g?ɂ܂Ƃ߂ēn?????̂ł??B?ȉ??ɗ???????܂??B
df.describe(include=['object'])
?@include?p?????[?^?[?ɂ?['object']?Ǝw?肵?Ă???̂ŁA?????ł͕??????v?f?Ƃ???????\???̑ΏۂƂȂ?܂??i??????ȊO?ɂ?object?^?ƂȂ?l?????邱?Ƃɂ͒??ӂ??Ă????????j?B???s???ʂ͎??̂悤?ɂȂ?܂????B
?@????͕??????v?f?Ƃ???????ΏۂƂȂ??Ă??܂??B?????āA?????????ꍇ?ɂ͗v?f???icount?j?A???j?[?N?ȗv?f?̐??iunique?j?A?ŕp?l?itop?j?A?ŕp?l?̓o??ifreq?j???\??????Ă???_?ɒ??ڂ??Ă????????B
?@?Ⴆ?A'Sex'???????ƁA???j?[?N?Ȓl??2??ށi???炭??male??female?j?A?ŕp?l??male?ł??̓o???577?ł??邱?Ƃ???A??q?Ə?g???̑??????j???ł????????Ƃ???????܂??B
?@?????????_???ƕ??????v?f?Ƃ??????w?肷??ɂ͎??̂悤?ɂ??܂??B
df.describe(include=['float', 'object'])
?@???̏ꍇ?͎??̂悤?ɁA?e??ɂ??ĕ\???ł??Ȃ????̂?NaN???\??????܂??i???e?I?ɂ͌???????K?v?͂Ȃ??ł??傤?j?B
?@?Ȃ??Ainclude?p?????[?^?[?ɂ?'all'???w?肷?邱?Ƃ??\?ł??B???̏ꍇ?͑S?Ă̗\???̑ΏۂƂȂ?܂??B
?@exclude?p?????[?^?[??'all'?̎w?肪?ł??Ȃ????Ƃ??????Ainclude?p?????[?^?[?Ɠ??l?Ɏw?肵?܂??B???????A??????͓??v?ʂ̕\???Ώۂ??珜?O?????̎w??ł??B?f?t?H???g?l?͏??O??????̂͂Ȃ??A?ƂȂ?܂????Ainclude?p?????[?^?[??exclude?p?????[?^?[???w?肵?Ȃ??ꍇ?ɂ́A???l?̗???ΏۂƂ???iinclude?p?????[?^?[?̃f?t?H???g?j?Ə??O??????̂͂Ȃ??iexclude?p?????[?^?[?̃f?t?H???g?j???g?ݍ??킳???āA???ʁA???l?̗????ΏۂƂȂ?܂??Bexclude?p?????[?^?[?ɉ??????w?肵???Ƃ??ɂ́A????ȊO?̗S?đΏۂƂȂ?_?ɂ͒??ӂ??Ă????????B
?@?Ō?ɂ????̃p?????[?^?[?̎w????@?ɂ͑??ɂ????낢??ȃo???G?[?V??????????܂??B?????ɂ??Ă?pandas?̃h?L???????g?upandas.DataFrame.describe?v???Q?l?ɂ??Ă????????B
?^?C?^?j?b?N?f?[?^?Z?b?g?????????????????ׂĂ݂?
?@???݂̃^?C?^?j?b?N?f?[?^?Z?b?g?ɂ͕???????܂ޗ?????݂????܂܂ł????A????ł???q?Ə?g???̓????Ɛ????ׂ邱?Ƃ͉\?ł??B??????Ƃ???Ă݂܂??傤?B
?@?܂??v???????Ԃ̂́A?????ƒj???ƂŐ??????ɍ??͂???̂??ǂ????A?ł?*1?B
tmp0 = df[(df['Sex'] == 'female') & (df['Survived'] == 0)]
tmp1 = df[(df['Sex'] == 'female') & (df['Survived'] == 1)]
tmp2 = df[(df['Sex'] == 'male') & (df['Survived'] == 0)]
tmp3 = df[(df['Sex'] == 'male') & (df['Survived'] == 1)]
survived_rate_f = len(tmp1) / (len(tmp0) + len(tmp1))
survived_rate_m = len(tmp3) / (len(tmp2) + len(tmp3))
print(survived_rate_f, survived_rate_m)
?@?ڂ????????͂??܂??A4?̕ϐ??ɑ?????Ă???̂?df?̗v?f?̒???'Sex'??̒l??'female'??'male'???A??????'Survived'??̒l??0??1???Ƃ????????????ɍ??v???Ă???v?f?ł??B???̐?????????????A?????҂̐???S?̂̐??Ŋ???ΐ?????????????Ƃ????킯?ł??B
?@???s???ʂ͈ȉ??̒ʂ?ł??B
?@?j???????????̕??????|?I?ɍ??????????ɂȂ??Ă??邱?Ƃ???????܂????B
*1?@???ۂɂ́A?????pandas.DataFrame.groupby???\?b?h???g???Ă????ƃV???v???ɏ????܂??i???A?܂?groupby???\?b?h???Љ?Ă??Ȃ??̂ŁA??ł͖ʓ|?ȃR?[?h?ɂȂ??Ă??܂??j?B
tmp = df.groupby('Sex')['Survived'].value_counts(normalize=True)
tmp
?@???l?ɁA???q?N???X?i?fPclass?f??j?Ɛ??????̊֘A?????Ă݂܂????i?R?[?h?͉??̉摜???Q?ƁB????Ă??邱?Ƃ͏?Ɠ????Bgroupby???\?b?h???g?????R?[?h?̎??s???ʂ??f?ځj?B
?@??????????Ă????q?N???X???悢?قǁA???????͍????Ȃ??Ă??܂??B
?@?Ō?ɗ??q?N???X?Ɛ??ʂ????????ɂǂ??W???Ă??邩?????Ă݂܂??傤?i?????ł?groupby???\?b?h?Ōv?Z????R?[?h?݂̂??摜?Ɍf?ځj?B
?@???q?N???X?̍??Ɛ??ʂ̍??ŁA????قǂ܂łɐ??????ɍ???????Ƃ͕M?҂??v???Ă??܂???ł????B?ł??A?????????????l??D??I?ɏ????悤?A??????D??I?ɏ????悤?A?ƍl???Ă̏?g???̍s???Ȃ̂???????܂???ˁB
?@???̂悤?Ɉꌩ????ƒP?Ȃ?f?[?^?̗???ł????Ȃ??f?[?^?Z?b?g?ł??A????f?[?^?Ƃ???f?[?^?̊֘A???????o???Ă??????ƂŁA?????̏o?????i???̏ꍇ?̓^?C?^?j?b?N?̎??̂ŋN???????Ɓj?̔w?i???????яオ?邱?Ƃ?????܂??B???̂悤?ɒT???I?Ƀf?[?^???͂??Ă??????ƂŁA???傫?Ȑ^?????????Ă??邩??????܂???i?z???g???ȁH?j?B
?@????͌????l??????i'Cabin'??Ȃǁj???ǂ?????悢???A???????v?f?Ƃ??????ǂ?????悢???ɒ??ڂ???DataFrame?I?u?W?F?N?g?Ɏ???????Ă????\??ł??B
AI?E?f?[?^?T?C?G???X?̊w?т?????????
???S?Ҍ????A?f?[?^???́EAI?E?@?B?w?K?EPython?̕????@?@??IT??Deep Insider?Ŋw?ڂ?
Copyright© Digital Advantage Corp. All Rights Reserved.
?A?C?e?B???f?B?A????̂??m?点
??IT eBook
RSS?ɂ???
?A?C?e?B???f?B?AID?ɂ???
???[???}?K?W???o?^
??IT?̃??[???}?K?W???́A ???????A???ׂĖ????ł??B???Ѓ??[???}?K?W???????w?ǂ????????B
ITmedia?̓A?C?e?B???f?B?A??????Ђ̓o?^???W?ł??B
???f?B?A?ꗗ | ????SNS | ?L???ē? | ???₢???킹 | ?v???C?o?V?[?|???V?[ | RSS | ?^?c??? | ?̗p??? | ??????