Question 1

Why does Pandas coerce my numpy float32 to float64 in this piece of code:

>>> import pandas as pd
>>> import numpy as np
>>> df = pd.DataFrame([[1, 2, 'a'], [3, 4, 'b']], dtype=np.float32)
>>> A = df.ix[:, 0:1].values
>>> df.ix[:, 0:1] = A
>>> df[0].dtype
dtype('float64')

The behavior seems so odd to me that wonder if it is a bug. I am on Pandas version 0.17.1 (updated PyPI version) and I note there has been coercing bugs recently addressed, see https://github.com/pydata/pandas/issues/11847 . I haven't tried the piece of code with an updated GitHub master.

Is it a bug or do I misunderstand some "feature" in Pandas? If it is a feature, then how do I get around it?

(The coercing problem relates to a question I recently asked about the performance of Pandas assignments: Assignment of Pandas DataFrame with float32 and float64 slow)

Question 2

I think it is worth posting this as a GitHub issue. The behavior is certainly inconsistent.

The code takes a different branch based on whether the DataFrame is mixed-type or not (source).

In the mixed-type case the ndarray is converted to a Python list of float64 numbers and then converted back into float64 ndarray disregarding the DataFrame's dtypes information (function maybe_convert_objects()).
In the non-mixed-type case the DataFrame content is updated pretty much directly (source) and the DataFrame keeps its float32 dtypes.

Why does Pandas coerce my numpy float32 to float64?

Related Q&A

Conda and Python Modules

WeakValueDictionary retaining reference to object with no more strong references

Using pretrained glove word embedding with scikit-learn

Is there an easy way to tell how much time is spent waiting for the Python GIL?

Inverse filtering using Python

Quadruple Precision Eigenvalues, Eigenvectors and Matrix Logarithms

How to use pyinstaller with pipenv / pyenv

Sending DHCP Discover using python scapy

cnf argument for tkinter widgets

python exceptions.UnicodeDecodeError: ascii codec cant decode byte 0xa7 in