Commit graph

60 commits

Author SHA1 Message Date
3e782f20ee aaahhuuhuhuh why does our simple groupby stop working and why do more complicated ways work worse 2021-06-02 08:56:27 -04:00
aa05dc5734 Keep a version of harvest data around with same keys other stuff has, before renaming columns 2021-06-02 08:56:01 -04:00
0b61f165e3 Start conversion to Harvest column titles 2021-06-02 05:58:42 -04:00
6997b7ebea Define/redefine more project names 2021-06-02 05:36:41 -04:00
3cb7b495f5 Elegant merge operator, minus the typo 2021-06-01 20:48:57 -04:00
5ac674bf0d Give ourselves the debug variables we invariably create anyhow 2021-06-01 20:47:42 -04:00
0d2fe87111 Use elegant merge operator 2021-06-01 20:47:08 -04:00
020da15372 Fix crucial assignment error, we had only descriptions after this! 2021-06-01 20:33:48 -04:00
7387893e57 Gather projects into all three groups: harvest, known non-harvest, and unknown 2021-06-01 20:33:23 -04:00
379da43764 take one at collecting just known harvest stuff 2021-06-01 20:30:30 -04:00
ab951f5b86 Split out subproject 2021-06-01 16:43:57 -04:00
960008bdae Refine a couple more projects 2021-06-01 16:39:49 -04:00
eb3560bb3e Start to prepare for Harvest timelogs precise format export 2021-06-01 11:23:45 -04:00
91643f088d Add additional projects, rename as needed 2021-06-01 11:23:05 -04:00
20f31dfc6a Add more projects and name per Harvest 2021-06-01 11:16:06 -04:00
24156c83ec Sort projects in alphabetical order 2021-06-01 11:06:02 -04:00
3ed6413b25 Start adding projects exactly as named in Harvest 2021-06-01 11:03:45 -04:00
8002ddedea Fix another reversed debug logic 2021-05-29 04:34:08 -04:00
aff12ffaf2 Add additional aliases 2021-05-29 04:33:46 -04:00
e8ef6b0d7c Compare alternatives (plus original) as lower case so every possible variation need not be explicit 2021-05-28 14:39:51 -04:00
369d727d65 Add many more variations of camp support/defense 2021-05-28 14:39:09 -04:00
3eaa9afeb9 Add sub-project of NICHQ to resolve to Project: Subproject: approach 2021-05-28 14:38:47 -04:00
6c18d88dbc Add Family & Home to our projects 2021-05-28 14:38:09 -04:00
df64927f5a Force non-date values to 'not a time' that still let's column be datetime 2021-05-28 14:24:07 -04:00
d952f11350 Make one CSV logfile vs directory an option determined via settings.ini
Default to directory.
2021-05-28 14:21:52 -04:00
8c8c00586d Fix logic reversal error in our interactive/debug mode identification 2021-05-28 14:00:36 -04:00
afce4adaea Add a misspelling of propaganda site that came up 2021-05-27 08:20:14 -04:00
306b61055f Up our debugging game significantly by not making the rookie equals assignment object mistake 2021-05-27 08:19:56 -04:00
baee9b5087 Fix double typo mistakes 2021-05-26 12:48:36 -04:00
f6dadbfab0 Update glob to reflect directory path should have a slash already 2021-05-26 12:48:24 -04:00
1f4ec77467 Gather all csv files in provided directory 2021-05-25 15:57:52 -04:00
mlncn
ca5417ae93 Fix both filtering and an opposite logic error and our newest feature is done 2021-05-03 00:13:07 -04:00
mlncn
a1c0434015 Add missing import statement 2021-05-03 00:04:18 -04:00
mlncn
01c8670eb6 Finish up filtering of pre-last-recorded stuff 2021-05-03 00:01:59 -04:00
mlncn
206530b235 Use .apply rather than np.where for easier/better/not-randomly-failing individual parsing
+# On some systems, using np.where worked but others failed.  Why it worked is
+# unknown but why it failed is because numpy where evaluates all parts, even
+# the parts that will never get used because the where clause does not apply!
+# This caused the chained strings to fail because— no string.

This worked fine on the System76 and didn't on Bridget's computer, even after
updating the version of python, but anyway, .apply() is better for what i am
trying to do here.

TODO convert other np.where uses to .apply

See #4 in https://datatofish.com/if-condition-in-pandas-dataframe/
2021-05-02 22:38:54 -04:00
mlncn
d8a5e27a97 Save when our last processed date is from so we can pick up where we left off 2021-05-02 20:34:22 -04:00
mlncn
1f451309a7 Add requirements (python) file 2021-05-02 18:39:37 -04:00
benjamin melançon
3dc4d28f2f Add aliases for NICHQ 2021-04-30 09:19:10 -04:00
benjamin melançon
b8e4a8d094 Regularize Drutopia references 2021-04-28 11:42:00 -04:00
benjamin melançon
f706f0b1e6 Add comment about criticality of fill na 2021-04-28 11:15:53 -04:00
benjamin melançon
ce13e32f7d Regularize project titles 2021-04-28 11:14:39 -04:00
benjamin melançon
761d04204a Output all entries, reset index for fun, and write out our CSV file 2021-04-28 11:08:43 -04:00
benjamin melançon
9e97b6b250 Keep our strings from being ragged 2021-04-28 11:05:02 -04:00
benjamin melançon
9faa5c44c4 Document multiple task entries piece 2021-04-28 00:16:15 -04:00
benjamin melançon
367b69da3d Allow multiple entries to be put into one by splitting with semicolon 2021-04-28 00:14:48 -04:00
benjamin melançon
28550ef3ff Fix approach to dropping column after using it 2021-04-27 23:51:26 -04:00
benjamin melançon
4314ad0774 Really close to having it all now 2021-04-27 23:37:37 -04:00
benjamin melançon
1c63ab32e8 Actually we want to use regex to be sure we have a digit to multiply
Ensure we're splitting on the same asterisk we found with regex

We can tone down the regex now that we've thrown in extra cleanup,
that's coming next.
2021-04-27 22:40:17 -04:00
benjamin melançon
2516b9272d Document making .contains not a regular expression 2021-04-27 22:31:42 -04:00
benjamin melançon
f31c9d4d47 Start to use our multiplier to have time be x times what we give 2021-04-27 22:30:38 -04:00