Package: repboxArt 0.1

Sebastian Kranz

repboxArt: Converting articles from PDF to text and managing and extracting basic information including tables

Converting articles from PDF to text and managing and extracting basic information including tables

Authors:Sebastian Kranz

repboxArt_0.1.tar.gz
repboxArt_0.1.zip(r-4.7-any)repboxArt_0.1.zip(r-4.6-any)repboxArt_0.1.zip(r-4.5-any)
repboxArt_0.1.tgz(r-4.6-any)repboxArt_0.1.tgz(r-4.5-any)
repboxArt_0.1.tar.gz(r-4.7-any)repboxArt_0.1.tar.gz(r-4.6-any)
repboxArt_0.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION
card.svg |card.png
repboxArt/json (API)

# Install 'repboxArt' in R:

install.packages('repboxArt', repos = c('https://repboxr.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/repboxr/repboxart/issues

On CRAN:

2.18 score 1 stars 1 packages 8 scripts 131 exports 26 dependencies

Last updated from:281611c890 (on main). Checks:7 WARNING, 2 OK. Indexed: yes.

Target	Result	Time	Files	Syslog
linux-devel-x86_64	WARNING	118
source / vignettes	OK	172
linux-release-x86_64	WARNING	174
macos-release-arm64	WARNING	82
macos-oldrel-arm64	WARNING	69
windows-devel	WARNING	75
windows-release	WARNING	68
windows-oldrel	WARNING	69
wasm-release	OK	98

Exports:activate_art_route art_ensure_correct_dirs art_extract_paren_type_from_tab_notes art_extract_pdf_raw_tabs art_extract_pdf_tabs art_extract_regstats art_get_html_files art_get_pdf_files art_has_html art_has_pdf art_has_two_col art_html_tab_standardize art_html_to_parts art_load_tab_df art_load_tabs art_load_text_parts art_load_txt_pages art_locate_col_refs art_locate_sentences art_locate_tab_fig_refs art_pdf_pages_to_parts art_pdf_to_txt_pages art_phrase_analysis art_refs_analysis art_reg_save_repdb art_reg_stats_phrases art_repair_two_col art_repair_two_col_aer_pandp art_save_basic_info art_save_repdb_tab art_tab_phrase_analysis art_tabs_to_regs art_text_parts_phrase_analysis art_update_project bind_rows_with_parent_fields cell_df_find_num_paren_pairs cell_df_to_tabhtml check_and_repair_footnote_candidates combine_short_paragraphs combine_text_lines ecta_parse_html ecta_parse_html_table ends.with.text ensure_empty_types example extract_all_to_index_df extract_num_from_sequence_text extract_order_num_from_sequence_text find_stars_str first_repair_art_pdf_text first.non.null from_to get_art_route get_art_route_dir get_art_tab_cell_with_reg_info get_phrases_def guess_journ html_tab_cell_row_panel_df html_table_cells_from_all_tr html_table_cells_from_tr identify_figure_lines_on_page is_aer_pandp is_really_a_note_line is.true jpe_parse_html jpe_parse_html_table keep.overlapping.loc left_join_overlap line_df_find_figures line_df_find_footnotes line_df_find_junk_lines line_df_find_page_header_footer line_df_find_section_cands line_df_find_sections line_df_to_parts_df lines_to_pages lines_to_plines load_art_route_parcels load_phrases_def loc_sep_lines loc_to_df locate_all_as_df make_art_small_reg make_phrases_def map_loc_to_parent_loc match_overlap most.common ms_parse_html ms_parse_html_table my_rank na.false na.remove na.val old.match_stat_to_reg_df pdf_to_txt_pages plines_to_lines readRDS.or.null refine_cell_df_and_add_panel_info remove_nested_html_elements remove.cols remove.overlapping.loc repbox_art_opts repbox_journ_list repbox.extract.pdf.images restat_parse_html restud_parse_html restud_parse_html_table rle_block rle_cummax_block rle_table route_art_tab_finish_route route_art_tab_set save_rds_create_dir sentences_merge_with_next seq_rows set_art_route show_cell_df_html substitute_wrong_pdf_txt_chars tab_df_to_cell_df tab_df_to_row_df tabname_to_tabid tabtitle_to_tabid text_df_add_section_cols text_df_standardize text_parts_tab_fig_references text_parts_to_loc txt_locate_keywords txt_locate_rx_keywords txt_locate_typed_keywords txt_phrase_analysis version_repbox_art

Dependencies:cli cpp11 data.table digest dplyr ExtractSciTab generics glue lifecycle magrittr pillar pkgconfig purrr R6 repboxUtils restorepoint rlang stringi stringr stringtools tibble tidyr tidyselect utf8 vctrs withr

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics

Usage by other packages (reverse dependencies)