Python : T test ind looping over columns of df

2024/9/19 3:24:36

My dataframe is composed of accounting variables and a dummy variable that allows me to identify two types of company. I would like to perform a t-test for every column of my dataframe in order to compare the means of the variables between the two types of company.

For the moment I have separated my df into two different df based on the dummy variable and run the following code:

for column_type1, column_type2 in zip(df_type1.columns[1:],df_type2.columns[1:]):print(ttest_ind(column_type1,column_type2, equal_var=False, nan_policy='omit'))

However, I'm getting the following error:

TypeError: cannot perform reduce with flexible type

If you know how to solve this or have a better way to do it your help is more than welcome!

Thanks

**** EDIT & SOLUTION ****

I've come along my issue and here the code for it.

for column_type1, column_type2 in zip(df_type1,df_type2):print(ttest_ind(df_type1[column_type1],df_type2[column_type2], equal_var=False, nan_policy='omit'))
Answer
for column_type1, column_type2 in zip(df_type1,df_type2):
print(ttest_ind(df_type1[column_type1],df_type2[column_type2], equal_var=False, nan_policy='omit'))
https://en.xdnf.cn/q/119419.html

Related Q&A

I want to understand which line of code outputs **none** in the function

The last line of the output is none can someone explain why pls def just_lyrics():print ("i am a bad coder")print (" i keep trying to learn everday")def double_lyrics():just_lyrics(…

How to clear python console (i.e. Ctrl+L command line equivalent)

OS = Linux[boris@E7440-DELL ~]$ uname -a Linux E7440-DELL 3.17.4-200.fc20.x86_64 #1 SMP Fri Nov 21 23:26:41 UTC 2014 x86_64 x86_64 x86_64 GNU/LinuxFrom python console (Spyder 2.2.4, Python 2.7.5 64bits…

Floating point accuracy in python

Any reason why c shouldnt equal 0.321?>>> from math import ceil >>> a = 123.321 >>> b = a % 60 >>> b 3.320999999999998 >>> ceil(b) 4.0 >>> c = cei…

datetime64 comparison in dataframes

I am struggling with datetime64 comparisons in dataframes to update a column. lets say we have a dataframe df with a date columndf.date.values[0] Out[128]: numpy.datetime64(2015-05-17T22:00:00.00000000…

Relative import of a apackage in python flask application

Trying to make the sample flask application more modular,I am new to python and flask trying to build a sample application where , I have planned to maintain the folder structure of the application a…

Same sparql not returning same results

Im using the same sparql statement using two different clients but both are not returning the same results. The owl file is in rdf syntax and can be accessed here. This is the sparql statement: PREFIX …

Accessing nested values in nested dictionaries in Python 3.3

Im writing in Python 3.3. I have a set of nested dictionaries (shown below) and am trying to search using a key at the lowest level and return each of the values that correspond to the second level. Pa…

scrape site with anti forgery token

Im trying to scrape data from website that uses anti forgery token what i tried to do is sending a get request then finding the key and use it to send a post request i was able to successfully scrape t…

Pandas merge and grouby

I have 2 pandas dataframes which looks like below. Data Frame 1: Section Chainage Frame R125R002 10.133 1 R125R002 10.138 2 R125R002 10.143 3 R125R002 10.148 4 R125R002 …

Find a pattern in the line of another file in python [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.Want to improve this question? Update the question so it focuses on one problem only by editing this post.Closed 3…