Pandas method corr() use not all features

2024/11/8 4:32:01

I have dataframe with shape (335539, 26). So I have 26 features. But when i use

data.corr() 

I get a 12 x 12 matrix.

What can be wrong? `

Answer

Pearson co-relation can only be used with continuous data. There is no point of changing the categorical features to numerate between 1 to n for various reasons. You can change them to numerical using one hot encoding technique or dummy variables technique. It is not clear as to between what type of data features you are trying to find a co-relation. If you are trying to find co-relation between nominal variable and continuous variable, it is better called measure of association and you can calculate that using ANOVA which has built in implementation in scipy library. If its between ordinal variable and continuous variable you can use Spearman's co-relation method.

If still you want to find co-relation using corr() try converting your data with the above methods I mentioned, although I am not sure if you will get correct results.

Its better to first formulate your question properly and then look for the specific test which support your sample space.

corr() takes only numerical data and thus you only find the co-relation between your numerical features.

https://en.xdnf.cn/q/120657.html

Related Q&A

int to datetime in Python [duplicate]

This question already has answers here:Convert string "Jun 1 2005 1:33PM" into datetime(26 answers)Parsing datetime in Python..?(2 answers)Closed 5 years ago.Im receiving data from the port.…

How to extract the historical tweets from twitter API? [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.Want to improve this question? Update the question so it focuses on one problem only by editing this post.Closed 5…

ValueError: invalid literal for int() with base 16: [closed]

Closed. This question needs debugging details. It is not currently accepting answers.Edit the question to include desired behavior, a specific problem or error, and the shortest code necessary to repro…

Modify list in Python [closed]

This question is unlikely to help any future visitors; it is only relevant to a small geographic area, a specific moment in time, or an extraordinarily narrow situation that is not generally applicable…

How to delete files inside hidden folder with python? [duplicate]

This question already has answers here:How can I delete a file or folder in Python?(18 answers)Closed 5 years ago.I want do delete a file, for ex. myfile.txt that is stored under a hidden folder. Is i…

Matching keywords in a dictionary to a list in Python

The following dictionary gives the word and its value:keywords = {alone: 1, amazed: 10, amazing: 10, bad: 1, best: 10, better: 7, excellent: 10, excited: 10, excite: 10}Following the dictionary are two…

How to make case insensitive? [closed]

Its difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying thi…

Webpage data extraction automation using python and selenium webdriver [closed]

Its difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying thi…

Asking the user if they want to play again [duplicate]

This question already has answers here:Ask the user if they want to repeat the same task again(2 answers)Closed 5 years ago.Basically its a guessing game and I have literally all the code except for th…

IF statement (checking for a string in a list) behave weirdly [duplicate]

This question already has answers here:How to test multiple variables for equality against a single value?(31 answers)Closed 10 years ago.This will probably be a dumb question, but why does this piece…