BUG: output formatting with to_html(), index=False and/or index_names=False (#22579, #22747) #22655

simonjayhawkins · 2018-09-10T03:35:05Z

closes Column Offset Bug with to_html(index=False) with MultiIndex Columns and Index #22579
closes Columns Index Name with to_html(index_names=False) is displayed. #22747
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

pep8speaks · 2018-09-10T03:35:07Z

Hello @simonjayhawkins! Thanks for updating the PR.

Cheers ! There are no PEP8 issues in this Pull Request. 🍻

Comment last updated on December 28, 2018 at 15:37 Hours UTC

codecov · 2018-09-10T12:27:35Z

Codecov Report

Merging #22655 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #22655      +/-   ##
==========================================
+ Coverage   92.17%   92.18%   +<.01%     
==========================================
  Files         169      169              
  Lines       50708    50697      -11     
==========================================
- Hits        46740    46734       -6     
+ Misses       3968     3963       -5

Flag	Coverage Δ
#multiple	`90.59% <100%> (ø)`	⬆️
#single	`42.36% <0%> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/io/formats/html.py	`91.96% <100%> (+1.27%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0976e12...6b441df. Read the comment docs.

codecov · 2018-09-10T12:27:36Z

Codecov Report

Merging #22655 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #22655      +/-   ##
==========================================
+ Coverage   92.29%   92.29%   +<.01%     
==========================================
  Files         163      163              
  Lines       51948    51956       +8     
==========================================
+ Hits        47945    47953       +8     
  Misses       4003     4003

Flag	Coverage Δ
#multiple	`90.7% <100%> (ø)`	⬆️
#single	`42.99% <0%> (-0.01%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/formats/html.py	`98.67% <100%> (+0.03%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ab55d05...5b635e4. Read the comment docs.

pandas/tests/io/formats/test_to_html.py

simonjayhawkins · 2018-12-10T12:06:52Z

@WillAyd @jreback comments addressed. ptal.

WillAyd · 2018-12-14T04:46:41Z

pandas/io/formats/html.py

+        # Determine if ANY column names need to be displayed
+        # since if the row index is not displayed a column of
+        # blank cells need to be included before the DataFrame values.
+        self.show_col_idx_names = all((self.fmt.has_column_names,


Sorry for questions but still trying to wrap my head around implementation. Based off of the comment, why is this all here and not any? Wouldn't any of these require there to be a cell where a column index name would be placed?

index=False with a single level row index and multi-level columns index with named columns but not all named...

index = pd.MultiIndex.from_product([['a','b'], ['c','d'], ['e','f']], names=[ 'foo',None, 'baz']) df = pd.DataFrame(np.arange(64).reshape(8,8), columns=index) result = df.to_html(max_rows=4, max_cols=4, index=False) print(result)

foo a ... b

c ... d

baz e f ... e f

0 1 6 7

8 9 14 15

48 49 54 55

56 57 62 63

Note: missing truncation indicators in data now fixed in master.

the misalignment of the column names is due to the logic being applied within the level generating loop..

pandas/pandas/io/formats/html.py

Lines 270 to 275 in d43ac97

name = self.columns.names[lnum]

row = [''] * (row_levels - 1) + ['' if name is None else

pprint_thing(name)]

if row == [""] and self.fmt.index is False:

row = []

hence class-level variable needed to check if ANY names need to be displayed to determine alignment.

ALL condition is to determine in ANY names should be displayed given the to_html parameters and uses similar logic as to_string etc.

pandas/pandas/io/formats/format.py

Lines 796 to 803 in d43ac97

def _get_formatted_index(self, frame):

# Note: this is only used by to_string() and to_latex(), not by

# to_html().

index = frame.index

columns = frame.columns

show_index_names = self.show_index_names and self.has_index_names

show_col_names = (self.show_index_names and self.has_column_names)

and the rows in to_html..

pandas/pandas/io/formats/html.py

Lines 307 to 309 in d43ac97

if all((self.fmt.has_index_names,

self.fmt.index,

self.fmt.show_index_names)):

There is currently no test to explicitly cover this example. so i think the best way forward is to fully parameterize the truncation tests in line with the parametrized basic_alignment tests for added assurance.

i'll make show_col_idx_names a class property for clarity and add a note to refactor and 'inherit' from DataFrameFormatter class. inherit quoted since HTMLFormatter class is not directly inherited from DataFrameFormatter. in the first refactor just use mock inheritence like..

pandas/pandas/io/formats/html.py

Lines 46 to 48 in d43ac97

@property

def is_truncated(self):

return self.fmt.is_truncated

simonjayhawkins · 2018-12-14T17:53:22Z

@jreback @WillAyd with the additional parameterization of the truncation tests, we now have test coverage for multi-indexes with more than 2 rows, missing column index names and truncation with standard row indexes. There is now test coverage in place allowing the refactoring of row-levels to class property in this PR for use by _write_header and _write_regular_rows. I've added a TODO in _write_hierarchical_rows to refactor after #22887 is fixed

simonjayhawkins · 2018-12-28T10:53:13Z

@WillAyd @jreback Could you please take another look. Thanks.

jreback · 2018-12-28T14:44:50Z

I would make a sub-dir of data/html to hold all of this test data (and move the original .html files as well).

jreback · 2018-12-28T16:41:21Z

@WillAyd over to you

jreback · 2019-01-01T16:28:37Z

@WillAyd

WillAyd · 2019-01-01T16:35:24Z

Thanks @simonjayhawkins !

jreback · 2019-01-01T17:04:40Z

thanks @simonjayhawkins !

* upstream/master: BUG: output formatting with to_html(), index=False and/or index_names=False (pandas-dev#22579, pandas-dev#22747) (pandas-dev#22655) MAINT: Port _timelex in codebase (pandas-dev#24520) Implement unique+array parts of 24024 (pandas-dev#24527) Integer NA docs (pandas-dev#23617)

…=False (pandas-dev#22579, pandas-dev#22747) (pandas-dev#22655)

BUG: header alignment

6adc266

simonjayhawkins added 7 commits September 10, 2018 12:16

add test_to_html_index_name_single_index

a30da56

add placeholders for multiIndex tests

b444fa2

add regression test

c61ea4a

add regression test

863b6d6

add regression test

b44d4ff

add regression test

d5c37e3

prep test_to_html_index_name_multi_index_both_index_false

6b441df

simonjayhawkins added 18 commits September 10, 2018 15:59

add regression test

dcf74a5

add regression test

dd07605

add regression test

4605a4e

add regression test

dd825f3

add regression test

b7fe95c

add regression test

dec609d

add regression test

5b9bc6e

move regression test

a725108

remove duplicated regression tests

d7e8237

add regression test

47fd132

add regression test

4e48a32

add failing test

30ac94e

split and rename tests

5f6a8d1

fix failing test

1b4c5dc

add test (failing)

82d57eb

fix failing test

cfa7570

refactor

9f884fb

add regression test (failing)

25e8103

WillAyd requested changes Sep 11, 2018

View reviewed changes

pandas/tests/io/formats/test_to_html.py Outdated Show resolved Hide resolved

simonjayhawkins added 3 commits December 9, 2018 20:47

Merge remote-tracking branch 'upstream/master' into issue22579

1febf76

additional test case for pandas-devgh-22783

6d60064

Merge remote-tracking branch 'upstream/master' into issue22579

1f61968

resolve merge conflicts

bd815e7

WillAyd reviewed Dec 14, 2018

View reviewed changes

simonjayhawkins added 7 commits December 14, 2018 12:36

add test case

7da52a1

make show_col_idx_names class property

0a0f82f

Merge remote-tracking branch 'upstream/master' into issue22579

bc7f8c7

parametrize truncation tests

8d2d68a

fix whitespace

d2e233e

Merge remote-tracking branch 'upstream/master' into issue22579

bdaa279

make row_levels class property

b7e4f54

simonjayhawkins added 2 commits December 28, 2018 15:26

Merge remote-tracking branch 'upstream/master' into issue22579

613ce00

move data files

5b635e4

jreback approved these changes Dec 28, 2018

View reviewed changes

jreback added this to the 0.24.0 milestone Dec 28, 2018

WillAyd approved these changes Jan 1, 2019

View reviewed changes

WillAyd merged commit b9284a2 into pandas-dev:master Jan 1, 2019

simonjayhawkins deleted the issue22579 branch January 1, 2019 21:00

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

BUG: output formatting with to_html(), index=False and/or index_names…

aabed36

…=False (pandas-dev#22579, pandas-dev#22747) (pandas-dev#22655)

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

BUG: output formatting with to_html(), index=False and/or index_names…

f95258e

…=False (pandas-dev#22579, pandas-dev#22747) (pandas-dev#22655)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: output formatting with to_html(), index=False and/or index_names=False (#22579, #22747) #22655

BUG: output formatting with to_html(), index=False and/or index_names=False (#22579, #22747) #22655

simonjayhawkins commented Sep 10, 2018 •

edited

Loading

pep8speaks commented Sep 10, 2018 •

edited

Loading

codecov bot commented Sep 10, 2018

codecov bot commented Sep 10, 2018 •

edited

Loading

simonjayhawkins commented Dec 10, 2018

WillAyd Dec 14, 2018

simonjayhawkins Dec 14, 2018

simonjayhawkins commented Dec 14, 2018

simonjayhawkins commented Dec 28, 2018

jreback commented Dec 28, 2018

jreback commented Dec 28, 2018

jreback commented Jan 1, 2019

WillAyd commented Jan 1, 2019

jreback commented Jan 1, 2019

	name = self.columns.names[lnum]
	row = [''] * (row_levels - 1) + ['' if name is None else
	pprint_thing(name)]

	if row == [""] and self.fmt.index is False:
	row = []

	def _get_formatted_index(self, frame):
	# Note: this is only used by to_string() and to_latex(), not by
	# to_html().
	index = frame.index
	columns = frame.columns

	show_index_names = self.show_index_names and self.has_index_names
	show_col_names = (self.show_index_names and self.has_column_names)

	if all((self.fmt.has_index_names,
	self.fmt.index,
	self.fmt.show_index_names)):

	@property
	def is_truncated(self):
	return self.fmt.is_truncated

BUG: output formatting with to_html(), index=False and/or index_names=False (#22579, #22747) #22655

BUG: output formatting with to_html(), index=False and/or index_names=False (#22579, #22747) #22655

Conversation

simonjayhawkins commented Sep 10, 2018 • edited Loading

pep8speaks commented Sep 10, 2018 • edited Loading

Comment last updated on December 28, 2018 at 15:37 Hours UTC

codecov bot commented Sep 10, 2018

Codecov Report

codecov bot commented Sep 10, 2018 • edited Loading

Codecov Report

simonjayhawkins commented Dec 10, 2018

WillAyd Dec 14, 2018

Choose a reason for hiding this comment

simonjayhawkins Dec 14, 2018

Choose a reason for hiding this comment

simonjayhawkins commented Dec 14, 2018

simonjayhawkins commented Dec 28, 2018

jreback commented Dec 28, 2018

jreback commented Dec 28, 2018

jreback commented Jan 1, 2019

WillAyd commented Jan 1, 2019

jreback commented Jan 1, 2019

simonjayhawkins commented Sep 10, 2018 •

edited

Loading

pep8speaks commented Sep 10, 2018 •

edited

Loading

codecov bot commented Sep 10, 2018 •

edited

Loading